( 



jam 0 4 7nn?' 

UNITED STATES DISTRICT COURT * ' uu *j£>J 
SOUTHERN DISTRICT OF TEXAS NcMfcHbcOrt 
HOUSTON DIVISION 



CLONTECH LABORATORIES, INC., § 

I H 02 -0017 

v. § Civil Action No, ____ 

§ 

BCM TECHNOLOGIES, INC § 
and BAYLOR COLLEGE OF MEDICINE, § 

§ 
§ 

Defendants. § 

- § 



COMPLAINT FOR DECLARATORY JUDGMENT 

Clontech Laboratories, Inc. files this Complaint for Declaratory Judgment and states as 
follows: 

Subject Matter Jurisdiction 
L This is an action for declaratory judgment arising under Title 35 of the United 
States Code and the declaratory judgment laws of the United States, 28 U.S.C. § 2201, et seq. 
Exclusive subject matter jurisdiction over this matter is conferred upon the Court pursuant to 28 
U.S.C. §§ 1331, 1338(a) and 1367 in that this is a civil action involving claims arising under the 
laws of the United States and wherein all other claims are so related to claims within this Court's 
jurisdiction that they form a part of the same case or controversy. 
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The Parties 

2. Plaintiff Cloritech Laboratories, Inc. ("Clontech") is a coiporation organized 
under the laws of the state of Delaware and having its principal place of business at 1020 East 
Meadow Circle, Palo Alto, California, 94303, 

3. On information and belief, Defendant BCM Technologies, Inc. is a corporation 
organized under the laws of the state of Delaware and having its principal place of business at 
1 709 Dryden Road, Suite 900, Houston, Texas 77030 ("BCMT*). 

4. On information and belief, Defendant Baylor College of Medicine is a nonprofit 
coiporation organized and existing under the laws of the State of Texas and having a principal 
place of business at One Baylor Plaza, Houston, Texas 77030 ( iC Baylor"). 

Venue and Personal Jurisdiction 

5. Venue is proper in this district pursuant to 28 U.S.C. §§ 1391(b)-(c). Defendant 
Baylor, a Texas nonprofit corporation and Defendant BCMT, a corporation having its principal 
place of business in Houston, Texas, have accused Plaintiff of using its patented technology and 
threatened to sue Plaintiff, a Delaware corporation doing business in the State of Texas, for 
patent infringement. Venue for such a suit is proper in this district and in Texas given that both 
corporate Defendants reside therein. 



COMPLAINT FOR DECLARATORY JUDGMENT - Page 2 



dl51520.1 




The Patent-in-Suit 



6. On information and belief, Baylor is the assignee of United States Patent No. 
5,851,808 to Stephen J. Elledge and Qinghua Liu, entitled RAPID SUBCLONING USING SITE- 
SPECIFIC RECOMBINATION, and which issued on December 22, 1998 ("the '808 patent"). 
(A copy of the '808 patent is appended hereto as Exhibit A). 

Case or Controversy 

7. An immediate, real, and justiciable controversy exists between the parties to this 
action. As set forth in paragraphs 8-19 below, Baylor and BCMT have threatened Clontech in a 
manner which has caused Clontech to have a reasonable apprehension that the Defendants will 
sue Clontech for the alleged infringement of the '808 patent through its past offerings for sale 
and continuing sales of its Creator™ products and products which depend upon or include the 
Creator™ technology. 

8. In April of 1998, Plaintiff Clontech entered into licensing negotiations with 
Baylor with respect to certain technology covered by the '808 patent ( <4 the Elledge technology")- 
As part of those negotiations, it was necessary for Clontech to receive various materials and 
information from Baylor and BCMT. 

9. Clontech agreed to use any information and/or materials it received from BCMT 
for purposes of evaluating its possible scientific and technical merits and its possible commercial 
potential. 

10. Between July 1998 and December 1998, Clontech and Baylor unsuccessfully 
attempted to negotiate terms for a license to the technology covered by the '808 patent. 
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11. On December 1 7, 1 998, Baylor informed Clontech that the technology covered by 
the '808 patent would be licensed to another company and not to Clontech. (The '808 patent 
then issued on December 22, 1998). 

12. Clontech, upon receiving notice from Baylor, that they would not be granted a 
license to the '808 patent's technology, continued its prior efforts towards the independent 
development of its own proprietary rapid subcloning technology. These efforts resulted in the 
development of a novel methodology which Clontech commercialized as its Creator™ products 
and technology. Clontech' s developmental efforts did not incorporate or otherwise include any 
information or materials received from Baylor and/or BCMT. 

13. In 1999, as a result of its independent development efforts, Clontech 
commercially launched its Creator™ system for rapid subcloning of DNA. Clontech has also 
filed a U.S. Patent Application directed to this proprietary version of rapid subcloning 
technology. Clontech's developmental work and the achievement of the novel and proprietary 
Creator™ product(s) was accomplished independently of any information or materials received 
from Baylor and/or BCMT. 

14. On or about December 5, 2001, Baylor and BCMT filed a Texas state court action 
in the 133 rd Judicial District Court of Harris County, Texas, Cause No. 2001-61352 ("the State 
Action"), against Clontech for various causes of action which are factually and legally 
intertwined with the allegations of this Complaint The State Action allegations include breach 
of contract, misappropriation of trade secrets, statutory theft of trade secrets, breach of fiduciary 
relationship and conversion. (A copy of the Petition is attached hereto as Exhibit B). 
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15. In the State Action, Baylor has alleged that certain biological materials, including 
specific plasmids and cell lines (which Baylor defined therein as the "Elledge trade secrets"), 
were employed by Clontech in the development of Clontech's Creator™ technology. 

16. Baylor and BCMT have specifically alleged that; "Clontech's Creator system is 
based upon and makes use of the Elledge trade secrets, including Elledge technology, and will 
continue to use the trade secrets, including the Elledge technology, thereby causing irreparable 
harm to Baylor and BCMT." (Exhibit B at ^ 16). 

17. Baylor and BCMT have alleged the Elledge technology is covered by the '808 
patent and that Clontech has made use of and continues to use the Elledge technology. As such, 
Baylor and BCMT have accused Clontech of infringing the '808 patent 

18. In its Prayer for Relief in the State Action, Baylor and BCMT seek several forms 
of relief which are normally available under the federal patent laws of Title 35 of the United 
States Code and the applicable Federal Circuit precedent Such forms of relief include 
injunction, attorneys' fees, and actual and treble damages as specified in the federal patent laws 
at 35 U.S.C. §§ 283-285. (Cf. Exhibit B at pp. 9-1 1). 

19. Further, representatives of Baylor have publicly stated that Baylor intends to sue 
Clontech for infringement of the '808 patent 

20. As demonstrated in the foregoing paragraphs, Clontech has a reasonable 
apprehension that, as demonstrated by and in addition to the State Action, it will be sued for 
patent infringement by Baylor and BCMT based upon its marketing and sales of its proprietary 
Creator™ products. Hence, an immediate, real, and justiciable controversy exists between the 
parties which gives rise to this declaratory judgment action. 
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Allegation of Non-Infringement 

21. Clontech incorporates by reference the specific allegations of paragraphs 1-20. 

22. Clontech has not infringed, and does not infringe, any valid claim of the '808 

patent. 

23. Clontech neither used the Elledge information or materials in its Creator™ 
technology nor did it base its Creator™ technology on any materials or information it received 
from Baylor or BCMT. 

24. The development of Clontech's Creator™ technology was entirely independent of 
Clontech's interaction with Baylor and without any benefit of the Elledge information, 
technology or materials, and the Creator™ products do not fall within the scope of any of the 
'808 patent claims. 

Allegation of Invalidity 

25. Clontech incorporates by reference the specific allegations of paragraphs 1-24. 

26. On information and belief, the claims of the '808 patent are invalid under the 
patent laws of the United States, as codified in Title 35 of the United States Code including, but 
not limited to, §§ 102, 103 and 112. 

27. On information and belief, the claims of the '808 patent are invalid under 35 
U.S.C. § 1 12, H 1 for, inter alia, failure to comply with the best mode requirement Baylor and 
BCMT have admitted that it maintains information concerning the Elledge technology as a trade 
secret thereby violating its obligations under 35 U.S.C. § 1 12, 1 1 . 
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28. On information and belief, the claims of the '808 patent are invalid as anticipated, 
derived from and/or obvious under 35 U.S.C. § 102/103 over the prior art. 

Request for Relief 

29. Clontech respectfully requests: 

a. a judgment declaring that Plaintiff has not infringed, and is not presently 
infringing, the '808 patent; 

b. a judgment declaring that the claims of the '808 patent are invalid; 

c. a judgment awarding its attorney fees pursuant to 35 U.S.C. § 285; 
~d. ajudgment awarding its costs (Fed. R. Civ. P. 54(d)); and 

e. any other relief the Court may deem appropriate under the circumstances. 

Dated: January 4, 2002. 



COMPLAINT FOR DECLARATORY JUDGMENT - Page 7 



dl51520.1 



V 



Respectfully submitted, 




Texas State Bar No. 09050800 
Southern District of Texas Id. No. 2171 
Jules Elese Angelley 
Texas State Bar No. 00793347 
Southern District of Texas Id. No. 19862 

1717 Main Street 

Suite 4600 

Dallas, Texas 75201-4605 
Telephone: (214) 743-4500 
Telecopy: (214)743-4545 

Of Counsel : 

Marc R. Labgold, Ph.D. 
Richard J. Oparil 
Kevin M. Bell 
Raymond Millien 
1200 19th Street, N.W. - 
Washington, D.C. 20036 
Telephone: (202) 861-3900 
Telecopy: (202)223-2085 

ATTORNEYS FOR DEFENDANT 
CLONTECH LABORATORIES, INC. 
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ABSTRACT 



The present invention provides compositions, including 
vectors, and methods for the rapid subcloning of nucleic acid 
sequences in vivo and in vitro. In particular, the invention 
provides vectors used to contain a gene of interest that 
comprise a sequence -specific recocibinase target site. These 
vectors are used to rapidly transfer the gene of interest into 
any expression vector thai contains a sequence-specific 
rccombinasc target site located downstream of a promoter 
element so that the gene of interest may be expressed. 
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UAl'ID SUBOUON1NC US1NC SITE-SI'ECIFIC 
RECOMBINATION 

FIELD OF *J*HE INVENTION 

The invention relates to recombinant D NA technology. In 
particular, Ihc invention relates to compositions, including 
vectors, and methods for the rapid subcloning of nucleic acid 
sequences in vivo and in vitro. 

BACKGROUND OF THE INVENTION 

Molecular biotechnology has revolutionized the produc- 
tioD of protein and polypeptide compounds of pharmaco- 
logical importance. The advent of recombinant DNA tech- 
nology permitted for the first time the production of proteins 
on a large scale in a recombinant host cell rather than by the 
laborious and expensive isolation of the protein from tissues 
which may contain .minute quantities of the desired protein 
(e.g., isolation of human growth hormone from cadaver 
pituitary). The production of proteins, including human 
proteins, on a large scale in a heterologous host requires the 
ability let express the protein of interest in the heterologous 
host. 'In is process typically involves isolation or cloning of 
the gene encoding the protein of interest followed by trans- 
fer of the coding region into an expression vector which 
contains elements (e.g., promoters) which direct the expres- 
sion of the desired protein in the heterologous host celL The 
most commonly used means of transferring or suhcloning a 
coding region into an expression vector involves the in vitro 
use of restriction endonucleases and DNA ligases. Restric- 
tion cndonuclcascs arc enzymes which generally recognize 
and cleave a specific DNA sequence in a double-stranded 
DNA molecule. Restriction enzymes are used to excise the 
coding region from the cloning vector and the excised DNA 
fragment is then joined using UNA ligasc to a suitably 
cleaved expression vector in such a manner that a functional 
protein may be expressed. 

'Ihc ability to transfer the desired coding region to an 
expression vector is often limited by the availability or 
sui Lability of restriction cn/ymc recognition sites. Often 
multiple restriction enzymes must be employed for the 
removal of the desired coding region and the reaction 
conditions used for each en/yme may differ such that it is 
necessary to perform the excision reactions in separate steps. 
In addition, it may be necessary to remove a particular 
enzyme used in an initial restriction enzyme reaction prior to 
completing all restriction enzyme digestions; this requires a 
time-consuming purification of the subcloning intermediate. 
Ideal methods tor the subcloning of DNA molecules would 
permit the rapid transfer of the target DNA molecule from 
one vector to another in vitro or in vivo without the need to 
rely upon restriction enzyme digestions. 

SUMMARY OF THE IN VENTION 

The present invention provides reagents and methods 
which comprise a system for the rapid subcloning of nucleic 
acid sequences in vivo and in vitro without the need to use 
restriction enzymes. Accordingly, the present invention pro- 
vides a nucleic acid construct comprising, in operable order: 
a) a conditional origin of replication; b) a sequence-specific 
recombinase target site having a 5' and a 3' end; and c) a 
unique restriction enzyme site, the restriction enzyme site 
located adjacent to the 3' end of the sequencc-spcciiic 
recombinase target site. The present invention is not limited 
by the nature of the conditional origin of replication 
employed. A variety of conditional origins of replication, 
including tempera hire -sensitive replicons are known to the 
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art and may be employed on the nucleic acid constructs of 
the present invention. In a preferred embodiment, the 
nucleic acid construct further comprises a prokaryotic ter- 
mination sequence. 'I be present invention is not limited by 

5 the nature of the prokaryotic termination sequence chosen. 
In one embodiment, the prokaryotic termination sequence is 
the T7 termination sequence. A variety of termination 
sequences are known to the art and may be employed in the 
nucleic acid constructs of the present invention including, 

10 the Tjjm Txj t T x2 , 1^3, TR A> TR^, T^. termination signals 
derived from the bacteriophage lambda [Lambda II, Ilendrix 
cl al. Eds., Cold Spring Harbor Press, Cold Spring Harbor, 
N.Y. (1983)] and termination signals derived from bacterial 
genes such as the trp gene of A', coti [Slryer, iiitxzhcmisiry, 

15 2nd ccL, W. H. Freeman and Co., San Francisco, Calif. 
(1981), p. 610]. 

In another preferred embodiment, the nucleic acid con- 
struct further comprises a cukaryolic polyadenylation 
sequence. The present invention is not limited by the nature 

2U of the eukaryottc polyadenylation sequence chosen. In one 
embodiment, the eukaryolic polyadenylation sequence is 
selected from the group consisting of the bovine growth 
hormone polyadenylation sequence, the simian virus 40 
polyadenylation sequence and the Ilcrpcs simplex virus 

25 thymidine kinase polyadenylation sequence. In yet another 
preferred embodiment, the nucleic acid construct further 
comprises a selectable marker gene, 'ihc present invention is 
not limited by the nature of the selectable marker gene 
chosen; the selectable marker may be a positive or negative 

3" selectable marker. In a preferred embodiment, the selectable 
marker is selected from the group consisting of the kana- 
mycin resistance gene, the ampicillin resistance gene, the 
tetracycline resistance gene, the chloramphenicol resistance 
gene, the streptomycin resistance gene, the strA gene and the 

35 sacB gene. 

'I "he present invention is not limited by the nature of the 
sequence -specific recombinase target site employed on the 
nucleic acid construct. In one embodiment, the sequence- 
Specific recombinase target site is selected from the group 

40 consisting of loxP, loP2, loxP3, loxP23, loxP511, loxB, 
loxC2, loxL, loxR, loxA86, loxA117,frt, dif, and att. 

In a preferred embodiment, the nucleic construct further 
comprises a gene of interest inserted into the unique restric- 

45 tion enzyme site. 

In a preferred embodiment, the nucleic acid construct has 
the nucleotide sequence set forth in SCO ID NO:l (i-e., 
pUNI-10). 

The present invention further provides a nucleic acid 
so construct comprising, in order 5* to 3': a) a conditional origin 
of replication; b) a sequence-specific recombinase target 
site; c) a polylinkcr; and d) a selectable marker gene. The 
present invention is not limited by the nature of the 
polylinkcr employed. Any collection of two or more reslric- 
55 tion enzyme sites which arc unique to the nucleic acid 
construct may be employed as the polylinkcr. The present 
invention is not limited by the nature of the conditional 
origin of replication employed. A variety of conditional 
origins of replication, including temperature-sensitive rep- 
en limns are known to the art and may be employed on the 
nucleic acid constructs of the present invention. In a pre- 
ferred embodiment, the nucleic acid construct further com- 
prises a prokaryotic termination sequence. The present 
invention is not limited by the nature of the prokaryotic 
65 termination sequence chosen. In one embodiment, the 
prokaryotic tcrnaination sequence is the 17 termination 
sequence. In another preferred embodiment, the nucleic acid 
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construe!, further comprises a cukaryolic polyadenylation 
sequence. The present invention is not limited by the nature 
of the euJca ryot ic polyadenylation sequence chosen. In one 
embodiment, the cukaryolic polyadenylation sequence is 
selected from the group consisting, of the bovine growth 
hormone polyadenylation sequence, the simian virus 40 
poly adenyla tion sequence and the Ilcrpes simplex virus 
thymidine kinase polyadenylation sequence. In yet another 
preferred embodiment, the nucleic acid construct further 



rccombinasc target site having a 5* and a 3' end, wherein the 
3' end of the promoter element is located upstream of the 5* 
end of the a sequence-specific recombinase target site; c) a 
gene of interest joined to the 3' end of the sequence-specific 
recombinase target site such that a functional translational 
reading frame is created; d) a conditional origin of replica- 
tion; e) a first selectable marker gene; f) a second sequence- 
specific recombinase target site; and g) an origin of repli- 
cation. In a preferred embodiment, the first and the second 



comprises a selectable marker gene. The present invention is 10 sequence-specific recombinase target site have 'the same 

not limited by the nature of the selectable marker gene nucleotide sequence (e.g., both comprise a loxP site). The 

chosen; the selectable marker may be a positive or negative first and the second sequence -specific recombinase target 

selectable marker. In a preferred embodiment, the selectable site need not have I be same nucleotide sequence provided 

marker is selected from the group consisting of the kana- the different sites can recombine with one another. 'I Tie 

mycin resistance gene, the ampicillin resistance gene, the 15 present invention is not limited by the nature or cither the 

tetracycline resistance gene, the chloramphenicol resistance conditional origin of replication or the no n -conditional ori- 

gene, the streptomycin resistance gene, the strA gene and the gin of replication employed. 

saeH gene. I n a preferred embodiment, the nucleic acid construct 

The present invention is not limited by the nature of the further comprises a second selectable marker gene. Prefer- 

seque nee-specific recombinase target site employed on the 20 a h!y the first and the second selectable marker genes are 

nucleic acid construct. In one embodiment, the sequence- different selectable marker genes. The present invention is 

specific recombinase target site is selected from the group not limited by the nature of the selectable marker genes 

consisting of loxP, loxP2, loxF3, loxP23, loxPSll, loxB, chosen; the selectable marker genes may be positive or 

loxC2, loxL, loxR, loxA86, loxA117,frt, dif, and art. negative selectable marker genes. As discussed above, the 

In a preferred embodiment, the nucleic construct further 25 present invention is not limited by the nature of the promoter 



comprises a gene of interest inserted into the polylinker. 

The present invention also provides a nucleic acid con- 
struct comprising in operable order a) an origin of replica- 
tion; b) a promoter element having a 5' and a 3 end; and c) 3(J 
a sequence -specific recombinase target site having a 5* and 
a 3* end. In a preferred embodiment the 3* end of the 
promoter element is located upstream of the 5' end of said a 
sequence-specific recombinase target site. The present 
invention is not limited by the nature of the origin of 35 
replication employed. A variety of nan-conditional origins 
of replication are known to the art and may be employed on 
the nucleic acid constructs of the present invention. The 
invention is not limited by the nature of the promoter 
clement employed. 'iTiose skilled in the art know that the ^ 
choice of the promoter element depends upon the type of 
host cell to be employed for expressing a gene(s) under the 
transcriptional control of the chosen promoter element. A 
wide variety of promoter elements functional in prokaryotic 
(e.g., E. colt) and cukaryotic (e.g., yeast, insect, marimals 4S 
inchiding humans) cells are known to the art and may be 
employed in the nucleic acid constructs of the present 
invention. 

In another preferred embodiment, the nucleic acid con- 
struct further comprises a selectable marker gene. The 50 
present invention is not limited by the nature of the select- 
able marker gene chosen; the selectable marker may be a 
positive or negative selectable marker. In u preferred 
embodiment, the selectable marker is selected from the 
group consisting of the kanamycin resistance gene, the 55 
ampicillin resistance gene, the tetracycline resistance gene, 
the chloramphenicol resistance gene, the streptomycin resis- 
tance gene, the strA gene and the sacD gene. The present 
invention is not limited by the nature of the sequence- 
specific recombinase target site employed 00 the nucleic 6( , 
acid construct. In one embodiment, the sequence-specific 
recombinase target site is selected from the group consisting 
of loxP, loxP2, loxP3, LoxP23, loxPSll, loxB, loxC2, loxL, 
loxR, loxA.86, loxA117, frt, dif and att. 



clement nor the nature of the sequence-specific recombinase 
target sites chosen. 

The present invention also provides a method for the in 
vitro recombination of nucleic acid constructs, comprising: 
a) providing: i) a first nucleic acid construct comprising, in 
operable order, a conditional origin of replication, a first 
sequence -specific rccombinasc target site and a first select- 
able marker gene; ii) a second nucleic acid construct 
comprising, in operable order, an origin of replication, a 
promoter element and a second sequence-specific recombi- 
nase target site; and iii) a site-specific recombinase; b) 
contacting the first and the second constructs in vitro with 
the site-specific recombinase under conditions such that the 
first and second constructs are recombined to form a third 
nucleic acid construct. 

In a preferred embodiment, the first construct employed in 
the method further comprises a gene of interest and the 
recombination of the first and second constructs places the 
gene of interest under the transcriptional control of the 
promoter element. In another preferred embodiment, the 
second construct further comprises a nucleotide sequence 
encoding an affinity domain and the recombination of the 
first and second constructs results in placing the gene of 
interest in frame with the sequence encoding the affinity 
domain, 'trie present invention is not limited by the nature of 
the affinity domain employed; a variety of suitable affinity 
domains are known to the art including glutathione-S- 
Iransfcrasc, the maltose binding protein, protein A, polyhis- 
tidine tracts, etc. 

In a preferred embodiment, the second construct further 
comprises a second selectable marker gene, the second 
selectable marker being different from the first selectable 
marker (present on the first construct). The present invention 
is not limited by the nature of the selectable marker genes 
chosen; the selectable marker genes may be positive or 
negative selectable marker genes. As discussed above, the 
present invention is not limited by the nature of the promoter 
element nor the nature of the sequence -specific recombinase 



The present invention further provides a nucleic acid 65 Urg^i si^s chosen, 
construct comprising in operable order: a) a promoter ele- The present invention further provides a method for the 

ment having a 5' and a 3 end; b) a first sequence-specific recombination of nucleic acid constructs in a prokaryotic 
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host, comprising: a) providing: i) a first nucleic acid con- 
struct comprising a conditional origin of replication, a first 
sequence-specific recombinase target site having a 5' and a 
y end, a unique restriction enzyme site, the restriction 
enzyme site located adjacent to the 3* cad of the sequence- s 
specific recombinase target site, a first selectable marker 
gene and a cukaryotic polyadenylation sequence; if) a sec- 
ond nucleic acid construct comprising an origin of 
replication, a promoter clement having a 5' and a 3* end and 
a second sequence-specific recombinase target site having a 10 
5' and a 3* end; in) a procaryotic host cell expressing a 
site -specific recombinase; b) introducing the first and said 
second *DNA constructs into the prokaryotic host cell under 
conditions such that the first and second I >N A constructs are 
rccombincd Lc> form a third nucleic acid construct capable of 15 
replicating in the host cell. 

in a preferred embodiment, the 3' end of the promoter 
clement is located upstream of the 5' end of the second 
sequence-specific recombinase target site. 

In another preferred embodiment, the method further 20 
comprises growing the host cell containing the third con- 
struct under conditions which select for the presence of the 
third construct. 

In a preferred embodiment, the first construct further 
comprises a prokaryotic termination sequence. The present 25 
invention is not limited by the nature of the pmkaryolic 
termination sequence chosen. In one embodiment, the 
prokaryotic termination sequence is the T7 termination 
sequence. 

In another preferred embodiment, the first construct fur- 
ther comprises a gene of interest inserted into the unique 
restriction endo nuclease recognition site. 

The present invention also provides a fusion protein 
comprising protein sequences derived from a glutathione- 35 
S-Uansfcrasc (Gsl) protein and a Crc protein; the Gst-Crc 
fusion protein may be provided in a purified form.' In a 
preferred embodiment, the Cisl-Crc fusion protein has the 
amino acid sequence of SEQ ID NO: 11. In another preferred 
embodiment, the Gst-Crc fusion protein is encoded by the ^ 
nucleic acid sequence of SEQ TD NO:10. 

DESCRIPTION OF THE DRAWINGS 

FIG. 1 provides a schematic illustrating certain elements 
of the pUNI vectors and the Univcctor Fusion System. 

FIG. 2A provides a schematic map of the pUNI-10 vector; 45 
the locations of selected restriction enzyme sites are indi- 
cated and unique sites are indicated by the use of bold type. 

FIG. 2B shows the DNA sequence of the loxP site and the 
poly linkers contained within pUNI-10 (i.c., nucleotides 
401-530 of SEQ ID NO:l). 

FIG. 3 A shows the olign nucleotides (SEQ TD NOS:4 and 
5) which were annealed to insert a kixP site into the 
polylinker of pGEX-2TKes to create pgsl-lox. 

FIG. 3B provides a schematic map of pOEX-2TKcs 53 
which includes an enlargement of the multiple cloning site 
(MCS). 

FIG. 4A shows the oligonucleotides (SEQ ID NOS:6 and 
7) which were annealed to insert a loxP site into the 
polylinker of pVL1392 to create pVL1392-lox. 6(l 

FIG. 4B provides a schematic map of pVl/1392 which 
includes an enlargement of the multiple cloning site (MCS); 
the ampiciUin resistance gene (Ap®) and the tac promoter 
are indicated. 

FIG. 5 A shows the oligonucleotides (SEQ ID NOS:8 and 65 
*>) which were annealed to insert a loxT site into the 
polylinker of pGAP24 to create pGAP24-lox. 
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FIG. SB provides a schematic map of pGAI*24 which 
includes an enlargement of the multiple cloning site (MCS); 
the ampiciUin resistance gene (Ap®), the GAP promoter 
(Ff7Ap)> lh<s origin from the 2 fan circle (2ft} and the 'VRPl 
gene, encoding N-(5*-phosphoribosyl)-anthraniUte 
synthetase, (TRP1) are indicated. 

FIG. 6 A shows the oligonucleotides (SEQ ID NOS:8 and 
9) which were annealed to insert a loxP site into the 
polylinker of pG AL14 to create pG AL14-lox. 

FIG. 6R provides a schematic map of pGAf .14 which 
includes an enlargement of the multiple cloning site (MCS); 
the ampiciUin resistance gene (Ap<5>), the GAL promoter 
(P^,), the yeast centromeric sequences (CEN), yeast 
autonomous replication sequences (ARS) and the TRP1 gene 
(llU'l) are indicated. 

FIG. 7 shows a Coomassic blue-stained SDS-PAGE gel 
showing the purification of Gst-Cre from E. coli cells 
containing pQL123. 

FIG. 8 provides a schematic showing the strategy 
employed for the in vitro recombination of a pUNI vector 
("pA," pUNI-5) with a pIIOST vector ("pB f w pQL103) to 
create a fused construct (**pAB**). The relevant markers on 
each construct are indicated as arc selected restriction 
enzyme sites. 

FIG. 9A provides a schematic showing the starting con- 
structs (pUNl-Skpl and pgsl-lox) and the predicted fusion 
construct (p<5st-Skpl) generated by an in vitro fusion reac- 
tion. 

FIG. 9B shows an clhichum bromide-stained gel is show- 
ing the separation of restriction fragments generated by the 
digestion of pUNI-Skpl, pgsl-lox and p<Jst-Skpl. 

FIG. 10A shows a Coomassic blue-stained SDS-PAGE 
gel showing the expression of the Gst-Skpl protein from E. 
coli cells containing p<isl-Skp1- 

FIG. 10B shows a Western blot of an SDS-PAGE gel 
containing extracts prepared from E. coli cells containing 
pGsl-Skpl which was probed using an anti-SkpI antibody. 

FIG. 11 shows a Western blot or an SDS-PAGE gel 
containing extracts prepared from E. coli cells (QIJB4) 
containing either a conventionally constructed Gst- Skpl 
plasm id or pGsl-Skpl (produced by an in vitro fusion 
reaction). 

FIG. 12 provides a schematic illustrating the in vivo gene 
trap method for the recombination of fox-containing vectors 
in a host cell constitutively expressing the Crc protein. 

FIG. 13 provides the nucleotide sequence of the wild-type 
loxP site (SEQ ID NO:12), the IoxP2 site (SEQ TD NO:13), 
the IoxP3 site (SEO ID NO: 14) and the IoxP23 site (SEQ ID 
NO:15). 

DEFINITIONS 

To facilitate understanding of the invention, a number of 
terms are defined below. 

As used herein, "a conditional origin of replication/' refers 
to an origin of replication that requires the presence of a 
functional trans-acting factor (e.g., a replication factor) in a 
prokaryotic host cell. Conditional origins of replication 
encompass tempera turc-sensitive rcplicons such as rep 
pSCl01*\ 

As used herein, the term "origin of replication** refers to 
an origin of replication that is functional in a broad range of 
prokaryotic host cells (i.e., a normal or non-<onditional 
origin of replication such as the ColHl origin and its 
derivatives). 
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*lric terms "sequence-specific rc«>rnbinasc" and "site- 
specific recombinase" refer to enzymes that recognize and 
bind to a short nucleic acid site or sequence and catalyze the 
recombination of nucleic acid in relation to these sites. 

•|hc terms "sequence-specific recombinasc target site** 5 
and "site-specific recombinasc target site" refer ti> short 
nucleic acid site or sequence which is recognized by a 
sequence- or site-specific recombinasc and which become 
the crossover regions during the site-specific recombination 
event, examples of sequence-specific recombinasc target 
sites include, but are not limited to, lox sites, frt sites, att 
sites and dif sites. 

The term "lox site" as used herein refers to a nucleotide 
sequence at which the product of the ere gene of bacterioph- 
age Pi, Cre recombinasc, can catalyze a site-specific recom- 
bination. A variety of lox sites are known to the art including 
the naturally occurring loxP (the sequence found in the PI 
genome), loxB, loxL and loxR (these are found in the E. coli 
chromosome) as well as a number of mutant or variant lox 
sites such as loxPSH, loxA86, loxA117, loxC2, loxP2, loxP3 
and loxP23. 

The term "frt site" as used herein refers to a nucleotide 
sequence at which the product of the FLP gene of the yeast 
2fun plasmid, FLP recombinasc, can catalyze a site-specific 
rccombina tion. 

The term "unique restriction enzyme site** indicates that 
the recognition sequence for a given restriction enzyme 
appears once within a nucleic acid molecule. For example, 
the EcoRI site is a unique restriction enzyme site within the 
plasmid pUNl-10 (SEQ il> NO:l). 

A restriction enzyme site is said to be located "adjacent to 
the 3' end of a sequence -specific recombinasc target site" if 
the restriction enzyme recognition site is located down- 
stream of the 3* end of the sequence-specific recombinasc 
target site. The adjacent restriction enzyme site may, but 
need not, be contiguous with the last or 3' nucleotide 
comprising the sequence-specific recomblnase target site. 
For example, the EcoRI site of pUNT-10 is located adjacent 
(within 3 nucleotides) to the 3' end of the loxP site (see FIG. 
2B); the Xhol, Ndel, Ncol sites arc also adjacent (ie., within 
about 10-150 nucleotides) to the ioxPsitc but these sites arc 
not contiguous with the 3* end of the loxP silo in pUNI-10. 

The terms "polylinker" or "multiple cloning site" refer to 
a cluster of restriction enTyme sites on a nucleic acid 
construct which arc utilized for the insertion and/or excision 
of nucleic acid sequences such as the coding region of a 
gene, lox sites> etc. 

The term "prokaryolic termination sequence" refers In a 
nucleic acid sequence which is recognized by the RNA 
polymerase of a prokaryotic host cell and results in the 
termination . of transcription. Prokaryotic termination 
sequences commonly comprise a GC-rich region that has a 
twofold symmetry followed by an AT-rich sequence [Stryer, 
supra]. A commonly used prokaryotic termination sequence 
is the T7 termination sequence. A variety of termination 
sequences are known to the art and may be employed in the 
nucleic acid constructs of the present invention including, 
the Tta. Tlb. T /u, T^cnnination signals 

derived from the bacteriophage lambda [Lambda H, Ilcndrix 
cl al. Eds., supra] and termination signals derived from 
bacterial genes such as the trp gene of £. coli [Stryer, supra]. 

The term "eukaryotic polya deny la tion sequence** (also 
referred to as a "poly A site** or "poly A sequence") as used 
herein denotes a DNA sequence which directs both the 
termination and polya denyla tion of the nascent RNA tran- 
script. Efficient poly a denyla tion of the recombinant Iran- 
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script is desirable as transcripts lacking a poly A tail arc 
unstable and arc rapidly degraded. 'I Tie poly A signal unlived 
in an expression vector may be "heterologous** or "endog- 
enous.** An endogenous poly A signal is one that is found 
naturally at the 3' cod of the coding region of a given gene 
in the genome. A heterologous poly A signal is one which is 
one which is isolated from one gene and placed 3' of another 
gene. A commonly used heterologous poly A signal is the 
SV40 poly A signal. The SV40 poly A signal is contained on 
( a 237 bp BamHT/BcTT restriction fragment and directs both 
termination and polyadenylaiion [J. Sam brook, supra, al 
16.6-16.7]; numerous vectors contain the SV40 poly A 
signal [e.g., pCEP4, pREP4, pEBVHis (Invitrogcn)]. 
Another commonly used heterologous poly A signal is 
I5 derived from the bovine growth hormone (BGII) gene; the 
BGI I poly A signal is available on a number of commercially 
available vectors [e.g., pcDNA3.1, pZcoSV2, pSccTag 
(Invitrogcn)]. The poly A signal from the Herpes simplex 
virus thymidine kinase (IISV tk) gene is also used as a poly 
A signal on expression vectors; vectors containing the HSV 
tk poly A signal include the pBK-CMV, pBK-RSV, 
pOP13CZAT vectors from Stratagene. 

As used herein* the terms "selectable marker** or "select- 
able marker gene** refers to the use of a gene which encodes 
25 an enzymatic activity that confers the ability to grow in 
medium lacking what would otherwise be an essential 
nutrient (e.g., the TKF1 gene in yeast cells); in addition, a 
selectable marker may confer resistance to an antibiotic or 
drug upon the cell in which the selectable marker is 
3(J expressed. A selectable marker may be used to comer a 
particular phenotype upon a host cell. When a host cell must 
express a selectable marker to grow in selective medium, the 
marker is said U> be a positive selectable marker (e.g., 
antibiotic resistance genes which confer the ability to grow 
35 in the presence of the appropriate antibiotic). Selectable 
markers can also be used to select against host cells con- 
taining a particular gene (e.g., the sacB gene which, if 
expressed, kills the bacterial host cells grown in medium 
containing 5% sucrose); selectable markers used in this 
4a manner are referred to as negative selectable markers or 
counter-selectable markers. 

As used herein, the term "vector" is used in reference to 
nucleic acid molecules that transfer DNA segments) from 
one cell to another. The term "vehicle" is sometimes used 
45 interchangeably with "vector." A "vector** is a type of 
"nucleic acid construct.** The term "nucleic acid construct" 
includes circular nucleic acid constructs such as plasmid 
constructs, plasmid constructs, cosmid vectors, etc. as well 
as linear nucleic acid constructs (e.g., X phage constructs, 
SO PCR products). The nucleic acid construct may comprise 
expression signals such as a promoter and/or an enhancer (in 
such a case it is referred to as an expression vector). 

The term "expression vector" as used herein refers to a 
recombinant DNA molecule containing a desired coding 
55 sequence and appropriate nucleic acid sequences necessary 
for the expression of the opcrably linked coding sequence in 
a particular host organism. Nucleic acid sequences neces- 
sary for expression in procaryoles usually include a 
promoter, an operator (optional), and a ribosomc binding 
en site, often along with other sequences. Eucaryotic cells are 
known to utilize promoters, enhancers, and termination and 
polyadenylaiion signals. 

The terms "in operable combination", "in operable order" 
and "opcrably linked" as. used herein refer to the linkage of 
65 nucleic acid sequences in such a manner that a nucleic acid 
molecule capable of directing the transcription of a given 
gene and/or the synthesis of a desired protein molecule is 
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produced. The Icrm also refers to the linkage of amino acid As used herein, ihc lerm "an oligonucleotide having a 
sequences in such a manner so thai a functional pro lei n is nucleotide sequence encoding a gene" means a nucleic acid 
produced. sequence comprising the coding region of a gene or in other 
The terms "transformation" and '"transfection** as used words the nucleic acid sequence which encodes a gene 
herein refer to the introduction or foreign UNA into prokary- 5 product The coding region may be present in either a 
otic or cucaryolic cells. 'Iransrormaiion of prokaryolic cells cDNA, genomic DNA or RNA form. When present in a 
may be accomplished by a variety of means known to the art DNA form, the oligonucleotide may he single-stranded (i.e., 
including the treatment of host cells with CaCLj to make the sense strand) or double-stranded. Suitable control elc- 
competent cells, clcctroporation, etc. Transfection of meats such as enhancers/promoters, splice junctions, poly- 
cukaryotic cells may be accomplished by a variety of means JQ adenylation signals, etc. may be placed in close proximity to 
known to the art including calcium phosphate -DNA the coding region of the gene if needed to permit proper 
co-precipi latum, IDEAE-dcxlran-mcdiatcd transfection, initiation of transcription and/or correct processing of the 
polybrene-mediatcd transfection, electroporation, primary UNA transcript Alternatively, the coding region 
microinjection, liposome fusion, lipofection, protoplast utilized in the vectors of the present invention may contain 
fusion, retroviral infection, and biolistics. 15 endogenous enhancers/promoters* splice junctions, inter- 
As used herein, the terms "restriction cndonuclcascs** and vening sequences, poly adeny lation signals, etc or a cumbi- 
"rcstriclion enzymes'* refer to bacterial enzymes, each of nation of both endogenous and exogenous control dements, 
which cut double-stranded DNA at or near a specific nuclc- As used herein, the term "regulatory element** refers to a 
olidc sequence. genetic element which controls some aspect of the expres- 
Astiscd herein, the term "recombinant ON Amolcculc" as 211 siori of nucleic acid sequences. For example, a promoter is 
used herein refers to a DNA molecule which is comprised of a regulatory element which facilitates the initiation of tran- 
segments of DNA joined together by means of molecular scrip tion of an operably linked coding region. Other regu- 
biological techniques. . la Lory elements are splicing signals, poly adeny la lio n signals, 

DNA molecules are said to have "5* ends" and "3' ends" termination signals, etc (defined infra), 

because mononucleotides are reacted to make oligooucle- 25 Transcriptional control signals in eukaryotes comprise 

o tides in a manner such that the 5' phosphate of one "promoter" and "enhancer" elements. Promoters and 

mononucleotide pentose ring is attached to the 3* oxygen of enhancers consist of short arrays of DNA sequences that 

its neighbor in one direction via a phosphodiestex linkage. interact specifically with cellular proteins involved in tran- 

Therefbre, an end of an oligonucleotides referred to as the scrip tion [Maniatis, T. et aL, Science 236:1237 (1987)]. 

"5* end" if its 5' phosphate is not linked to the 3* oxygen of 3U l*romotcr and enhancer elements have been isolated from a 

a mononucleotide pentose ring and as the "3* end" if its 3* variety of cucaryotic sources including genes in yeast, insect 

oxygen is not linked to a 5* phosphate of a subsequent and mammalian cells and viruses (analogous control 

mononucleotide pentose ring. As used herein, a nucleic acid elements, i.e., promoters, arc also found in procaryotes). 'I*he 

sequence, even if internal to a larger oligonucleotide, also selection of a particular promoter and enhancer depends on 

may he said to have 5* and -V ends. Tn either a linear or 35 what cell type is to be used to express the protein of in teresL 

circular DNA molecule, discrete elements are referred to as Some cucaryotic promoters and enhancers have a broad host 

being "upstream" or 5* of the "downstream" or 3' elements. range while others are functional in a limited subset of cell 

This terminology reflects the fact that transcription proceeds types [for review see "Vbss, S.D. et al., Trends Tiiochem. Sci, 

in a 5* to 3* fashion along the DNA strand. The promoter and 11:287 (1986) and Maniatis, T. et al., supra (1987)]. For 

enhancer elements which direct transcription of a linked an example, the SV40 early gene enhancer is very active in a 

gene are generally located 5' or upstream of the coding wide variety of cell types from many mammalian species 

region. However, enhancer elements can exert their effect and has been widely used for the expression of proteins in 

even when located 3' of the promoter element and the coding mammalian cells [Dijkema, R. et al., EMBO J. 4:761 (1985) 

region. Transcription tormina tion and poly adeny la tion sig- ]. Two otoer examples of piomoleryenhanccr elements active 

nals arc located 3' or downstream of the coding region. 45 in a broad range of mammalian cell types arc those from the 

The 3' end of a promoter element is said to be located human elongation factor 10 gene [Uetsuki,T. et al.,/. Biol 

upstream of the 5' end of a sequence-specific recombinase CVicm., 264:5791 (1989), Kim, 13. W. el al., (*ene 91:217 

target site when (moving in a 5* to 3* direction along the (1990) and Mizushima, S. and Nagala,S., Nuc. Acids. Res., 

nucleic acid molecule) the 3* terminus of a promoter element 18:5322 (1990)] and the long terminal repeats of the Rous 

(the transcription start site is taken as the 3' end of a so sarcoma virus [Gorman, C M. el al., Fmc. NalL Acad. Set 

promoter element) precedes the 5' end of the sequence- USA 79:6777 (1982)] and the human cytomegalovirus 

specific recombinase target site. 'Ihc 3' end of the promoter [Boshart, M. et al., CeU 41 :52l (1 985)]. 

element may be located adjacent (generally within about 0 As used herein, the term "promoter/enhancer" denotes a 

to 500 bp) to the 5' end of the sequence-specific recombinase segment of DNA which contains sequences capable of 

target site; such an arrangement is used when the pHOST 55 providing both promoter and enhancer functions (i.e., the 

vector is not intended to permit the expression of a trans- functions provided by a promoter element and an enhancer 

la tion al fusion with the gene of interest donated by a pUNI element, see above for a discussion of these functions). For 

vector. Alternatively when the pHOST vector is intended to example, the long terminal repeals of retroviruses contain 
permit the expression of a transl at ion al fusion, the 3' end of both promoter and enhancer functions. The enhancer/ 

the promoter element is located upstream of both the 611 promoter may be "endogenous** or "exogenous** or "heler- 

scquences encoding the ammo-terminus of a fusion protein ologous." An "endogenous'* enhancer/promoter is one which 

and the 5' end of the sequence-specific recombinase target is naturally linked with a given gene In the genome. An 

site; in this case, the 5* end of the sequence -specific recom- "exogenous** or "heterologous** enhancer/promoter is one 

binase target site is located within the coding region of the which is placed in juxtaposition to a gene by means of 
fusion protein (e.g., located downstream of both the pro- <S5 genetic manipulation (i.e., molecular biological techniques) 
motor clement and the sequences encoding the affinity such that transcription of that gene is directed by the linked 

domain, such as Gst). enhancer/promoter. 
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The presence of "splicing signals" on an expression 
vcclor often results in higher levels of expression or the 
recombinant transcript Splicing signals mediate the removal 
of introns from the primary KN A transcript and consist of a 
splice donor and acceptor site [Sambrook, J. et al. t Molecu- 
lar Cloning: A Laboratory Manual, 2nd ed., Cold Spring 
Harbor laboratory Press, New York (1989) pp. 16.7-16.8]. 
A commonly used splice donor and acceptor site is the splice 
junction from the 16S UNA uf S V40. 

Efficient expression of recombinant DNA sequences in 
eucaryotic cells requires expression of signals directing the 
efficient termination and polyadenylatioo of the resulting 
transcript. Transcription termination signals arc generally 
found downstream of the poly ade aviation signal and are a 
few hundred nucleotides in length. *l"hc term "poly A Kile** or 
"poly A sequence" as used herein denotes a DNA sequence 
which directs both the termination and poly ade ny la lion ol 
Ihe nascent RN A transcript. Efficient polyadcnylalion or the 
recombinant transcript is desirable as transcripts lacking a 
poly A tail are unstable and are rapidly degraded. The poly 
A signal utilized in an expression vector may be "heterolo- 
gous" or "endogenous. 9 * An endogenous poly A signal is one 
that is found naturally at the 3* end of the coding region of 
a given gene in the genome. A heterologous poly A signal is 
one which is one which is isolated from one gene and placed 
3' of another gene. 

Eucaryotic expression vectors may also contain "viral 
repticons" or "viral origins or replication.*" Viral repticons 
are viral DNA sequences which allow Tor the cxlrachromo- 
somal replication of a vector in a host cell expressing the 
appropriate replication factors. Vectors which contain either 
the SV40 or polyoma virus origin of replication replicate to 
high copy number (up to 10* copies/cell) in cells that express 
the appropriate viral T antigen. Vectors which contain the 
replicons from bovine papillomavirus or Bpstein-Barr virus 
replicate extrachromosomally at low copy number (—100 
copies/cell). 

As used herein, the terms "nucleic acid molecule 
encoding," "DNA sequence encoding," and "DNA encod- 
ing** refer to the order or sequence of deoxyribonucleo tides 
along a strand of deoxyribonucleic acid. The order of these 
deoxy ribonucleotides determines the order of amino acids 
along the polypeptide (protein) chain. The DNA sequence 
thus codes fur the amino acid sequence. 

As used herein, the term "gene" means the deoxy ribo- 
nucleotide sequences comprising the coding region of a 
structural gene and the including sequences located adjacent 
to the coding region on both the 5' and 3' ends for a distance 
of about 1 kb on cither end such that the gene corresponds 
to the length of the full-length mRNA. The sequences which 
are located 5' of the coding region and which arc present on 
the mRNA arc referred to as 5* non-translated sequences. 
The sequences which are located 3* or downstream of the 
coding region and which are present on the mWNA are 
referred to as 3' non-translated sequences; these sequences. 
The term "gene** encompasses both cDNA and genomic 
forms of a gene. A genomic form or clone of a gene contains 
the coding region interrupted wUh non-coding sequences 
termed "introns" or "intervening regions" or "intervening 
sequences.* 1 Tn Irons are segments of a gene which are 
transcribed into nuclear RNA(hnRNA); introns may contain 
regulatory elements such as enhancers. Introns are removed 
or "spliced out" from the nuclear or primary transcript; 
introns therefore are absent in the messenger RNA(mRNA) 
transcript. The mRNA functions during translation to 
specify the sequence or order of amino acids in a nascent 
polypeptide. 
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In addition to containing introns, genomic forms or a gene 
may also include sequences located on both the 5* and 3* end 
of the sequences which are present on the RNA transcript. 
'Ihe.se sequences are referred to as "flanking" sequences or 

5 regions (these flanking sequences axe located 5' or 3* to the 
non-translated sequences present on the mRNA transcript). 
The 5* flanking region may contain regulatory sequences 
such as promoters and enhances which control or influence 
the transcription of the gene, 'ihc 3* flanking region may 

10 contain sequences which direct the termination of 
transcription, post-transcripuonal cleavage and polyadeny- 
lation. 

As used herein, the term "purified" or "to purify*' refers to 
the removal of contaminants from a sample. For example, 
1 5 recombinant Cre polypeptides are expressed in bacterial host 
cells (as a Gst-Cre fusion protein) and the Ore polypeptides 
are purified by the removal of host cell proteins; the percent 
of recombinant Ore polypeptides is thereby increased in the 
sample. 

20 The term "recombinant DNA molecule*" as used herein 
refers to a DNA molecule which is comprised of segments 
of DNA joined together by means of molecular biological 
techniques. 

The term "recombinant protein** or "recombinant 

25 polypeptide** as used herein refers to a protein molecule 
which is expressed from a recombinant DNA molecule. 

The term "native protein'* as used herein to indicate that 
a protein does not contain amino acid residues encoded by 

3(J vector sequences; that is the native protein contains only 
those arnino acids found in the protein as it occurs in nature. 
A native protein may be produced by recombinant means or 
may be isolated from a naturally occurring source. 

As used herein the term "portion" when in reference to a 

35 protein (as in "a portion of a given protein**) refers to 
fragments of that protein. The fragments may range in size 
from four amino acid residues to the entire amino acid 
sequence minus one amino acid. 

As used herein, the term "fusion protein" refers to a 

4n chimeric protein containing the protein of interest (i.e., the 
Cre protein) joined to an exogenous protein fragment (the 
fusion partner which consists of no n- Cre protein sequences). 
The fusion partner may enhance solubility of the protein of 
interest as expressed in a host cell, may provide an affinity 

45 tag to allow purification of the recombinant fusion protein 
from the host cell or culture supernatant, or both. If desired, 
the fusion protein may be removed from the protein of 
interest (ie. F the Cre protein) by a variety of enzymatic or 
chemical means known to the art. 

50 DESCRIPTION OK THE INVENTION 

'ihc present invention provides reagents and methods 
which comprise a system for the rapid subcloning of nucleic 
acid sequences in vivo and in vitro without the need to use 
55 restriction enzymes. This system is referred to as the 
Uni vector Fusion System. The basis of the Uni vector Fusion 
System is u vector termed the Uni vector or the pUNI vector 
into which sequences encoding a gene of interest (cl)N A or 
genomic) are inserted. The pUNT vector has a sequence- 
fid specific recombinuse target site, such us a loxP site, preced- 
ing the insertion site for the gene of interest, a selectable 
marker gene (this feature is optional) and a conditional 
origin of replication that is active only in host cells express- 
ing the requisite trans-acting replication factor. The pUNI 
6$ vectors are designed to contain a gene of interest but lack a 
promoter for the expression of the gene of interest. Ihc gene 
of interest may be cloned directly into the pUNI vector (i e., 
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Ihc pUNI vector may be used as a cloning vector, particu- 
larly for the cloning of cDNA libraries) or a previously 
cloned gene of interest may be inserted (i.e., suhctoned) into 
the pUNI vector. 

Using a sequence -specific recombinasc (e.g., Crc 5 
recombinasc), a precise fusion of the pUNI vector into a 
second vector containing another copy of the sequence- 
specific recombinase target site found on the pUNT vector is 
catalyzed. *l*he second vector, referred to generically as a 
pIIOST vector, is an expression vector that contains the 10 
sequence-specific recombinase target site downstream of the 
promoter element contained within the expression vector. 
Following the site-specific recombination event which 
occurs between the single sequence-specific recombinase 
target sites located on each vector (e.g., the pUNI vector and is 
the pIIOST vector), the two vectors are stably fused in a 
manner thai places the expression of the gene of interest 
under the control of the promoter clement contained within 
the expression vector. 'Phis fusion event also occurs in a 
manner that retains the proper lranslalional reading frame of 2U 
the gene of interest. This subcloning event occurs without 
the need to use restriction enzymes. 

'ihc fusion or recombination event can be selected for by 
selecting for the ability of host cells, which do not express 
the trans-acting replication factor required for replication of 25 
the conditional origin contained on the pUNI vector, to 
acquire the selectable p he no type conferred by the selectable 
marker gene (if present) on the pUNI vector. The pUNT 
vector cannot replicate in cells that do not express the 
trans-acting replication factor and therefore, unless the 3U 
pUNT vector has integrated into the second vector that 
contains a non-conditional origin of replication, pUNI will 
be lost from the host cell. 

The Uoivector Fusion System allows any number of ^ 
expression constructs containing the gene of interest present 
on the pUNT vector to be made rapidly (e.g., within a single 
day). Using conventional cloning or subcloning techniques 
which employ restriction enzyme digestions) v the produc- 
tion of a single expression vector containing a gene of ^ 
interest can take several days (for the design and construc- 
tion of each expression vector). In contrast, once a battery of 
expression vectors modified to contain the appropriate 
sequence-specific recombinasc target site is made, a gene of 
interest can be transferred to any number of expression 
vectors in an afternoon using the Univcctor fusion SysLcm. 

FIG. 1 provides a schematic illustrating certain elements 
of the pUNI vectors and the Univcctor Fusion System. 

a) Conditional Origins of Replication and Suitable Host 
Cells SO 

Conditional origins of replication are origins which 
require the presence or expression of a trans-acting factor in 
the host cell for replication. A variety of conditional origins 
of replication functional in prokaryotic hosts (e.g., E. colt) 
are known to the art. The present invention is illustrated but 55 
not limited by the use of the R6Ky origin, oriR, from the 
piasmid R6K. The R6Ky origin requires a trans-acting 
factor, the II protein supplied by the pir gene [Me tea If el aL 
(1996) Plasmid 35:1} E. coli strains containing the pir gene 
will support replication of R6K7 origins to medium copy 6<i 
number. A strain containing a mutant allele of pir, pir-116, 
. will allow an even higher copy number of constructs con- 
taining the R6Ky origin. 

E. coli strains that express the pir or pir- 116 gene product 
include DW18815 (AXCC 47079; this strain contains the 65 
pir-116 gene), BW 19094 (ATCC 47080; this strain contains 
the pri gene), BW20978 (this strain contains the pir-116 
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gene), BW20979 (this strain contains the pir gene), 
BW21037 (this strain contains the pir-116 gene) and 
BW21038 (this strain contains the pir gene) (Met calf et al, 
supra). 

Other conditional origins of replication suitable for use on 
the pUNT vectors of the present invention include: 

1) the RK2 oriV from the plasmid RK2 (ATCX: 37125). 
The RK2 oriV requires a trajis-acting protein encoded by the 
trfA gene [Ayres et al. (1993) J. Mol. Biol. 230:174]; 

2) the bacteriophage PI ori which requires the repA 
protein for replication [l J a1 cl al. (1986) J. Mol. Biol. 
192:275]; 

3) the origin of replication of the plasmid pSCl 01 (ATCC 
37032) which requires a plasmid encoded protein, rep A, Tor 
replication [Sugiura et aL (1992) J. BacterioL 175: 5993]. 
The pSClOl ori also requires IHF, an E. caU protein. /?. coli 
strains carrying the him A and himD (hip) mutants (the him 
and hip genes encode sub units of 1 1 IF) cannot support 
pSClOl replication [Stcnzcl ct aL (1987) Cell 49:709j 

4) the bacteriophage lambda ori which requires the 
lambda O and P proteins [T^ambda IT, Hendrix et al. Eds., 
Cold Spring Harbor Press, Cold Spring Ilarbor, N.Y. 
(1983)]; 

5) pB 11322 and other 0>1B1 derivatives will not replicate 
in polA mutants of E. coli and therefore, these origins of 
replication and be used in a conditional manner [Grindley 
and Kelley (1976) Mol Cen. Genet. 143:311]; and 

6) replication-thermosensitive plasm ids such pSU739 or 
pSU300 which contain a thermosensilive replicon derived 
from plasmid pSClOl, rep pSClOl" which comprises oriV 
[Mcndiola and de ta Cm* (1989) Mol. 

MicrobioL 3:979 and Irancia and Lobo (1996) J. Bad. 
178:894]. pSU739 and pSU3O0 are stably maintained in E. 
coli strain l)H5<i (Gibco BR1-) al a growth temperature of 
30° (X (42° C is non-permissive for replication of this 
replicon). 

Other conditional origins of replication, including other 
temperature sensitive rcplicons, are known to the art and 
may be employed in the vectors and methods of the present 
invention. 

b) Sequence-Specific Rccombinascs And Target Recog- 
nition Sites 

The precise fusion between the pUNI vector and the 
expression vector is catalyzed by a site -specific recombi- 
nasc. Site-specific rccombinascs are enzymes that recognize 
a specific DNA site or sequence (referred to herein geneti- 
cally as a "sequence-specific recombinase target site") and 
catalyzes the recombination of DNA in relation to these 
sites. Site -specific recombinases are employed for the 
recombination of ON A in both prokaryolcs and cukaryotcs. 
Examples of site-specific recombination include 1) chromo- 
somal rearrangements which occur in Salmonella typhi/mi- 
lium during phase variation, inversion of the FT J* sequence 
during the replication of the yeast 2 /un circle and in the 
rearrangement of immunoglobulin and T cell receptor genes 
in vertebrates, 2) integration of bacteriophages into the 
chromosome of prokaryotic host cells to form a lysogen and 
3) transposition of mobile genetic elements (e.g., 
transposons) in both prokaryolcs and cukaryotcs. 'Ihc term 
"site-specific recombinase" refers to enzymes that recognize 
short DNA sequences that become the crossover regions 
during the recombination event and includes rccombinascs, 
transposases and in le g rn.se s. 

'Ihc present invention is illustrated but not limited by the 
use of vectors containing loxP sites and the recombination of 
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these vcclors using the <:rc rccornbina.se of bacteriophage 11. between two lox sites having the same sequence (ihc effi- 

'ITtc Cso protein catalyses rccombi nation of l>N A between ciency will depend on the degree and the location of the 

two loxP sites [Sternberg ct al (1981) Cold Spring Harbor variations in the binding sites). For example, the loxC2 site 

Symp. Quant. Biol. 45:297]. The loxl 1 sites may be present can be efficiently recombined with the loxP site; these two 

on the same DNA molecule or they may be present on s i ox sites differ by a single nucleotide in the lefi binding site. 

of.do^-str^cdMbps^ueoceCSEOlDNOa^w^b "ffSS^SSu^^ 

comprises two 13 bp inverted repeat sequences separated by intrude- kwuiuiu««> 

an 8 bp spacer region [Hocssetat. (1982) Proc. Natl. Acad. 1Q ^™f' ^ im ' . . , - «. t 

ScL USA 79:3398 and U-S. Pat. No. 44)59317, the disclo- lhc H J .' rc^>mbina^or the 2p. plasmid oVSacdui^ 

sure of which is herein incorporated by reference]. The ?Z^™ rev £T [Cox 9 9S ? Nat i L . i > c ^ ^ V ^ 

internal spacer sequence of the loxPsitc is asymmetrical and 80:4223] which recognize the ftl site which, like the loxP 

thus, two loxP sites can exhibit directionality relative to one f\™ m ^*™ v ^ 

another [Hoess ct al. (1984) Proc. Natl. Acad. Sci. USA 1S * *>P ?**%L^ ^jS^^HSFI^^^^^^^^ 

81:1026]: When two loxl* sites on the same DNA molecule " ^S^ 0 ^: 3 ^ 12 ? ^ N ? :18 * " adc 5^ 

are in a directly repealed orientation, Crc excises Ihc DNA The FLP gene has been clorjed and expressed in E, cofc(Cbx, 

between these two sites leaving a single loxl' site on the m ^^f^^^^J^T^ 

DNAmoleculc[Abremskietal. (1983) <^11 32:1301]. If two ^^cation ^nUS9Z^8M, ftbhciton No.: WO 

loxP sites are in opposite orientation on a single DNA ^ W15694, the disclosure of which is herein irx^rporated by 

molecule, Cre inverts the DNA sequence between these two reference) and has been purified 1 [Mcyer-t *an ct al. (1 987) 

sites rather than removing the sequence. Two circular DNA ^clcic Acids Rc^. 15:6469; Babmcau el al (1985) J Biol 

molecules each wntaining a single loxP site will recombine ^l^^"^ Sadowskl < 1 9a5 > J * 

with another to form a mixture of monomer, dimer, trimer, Hwl - <:hcTn - 260:12328]; 

etc. circles. The concentration of the DNA circles in the „ 2) the Int rccombinasc of bacteriophage lambda (with or 

reaction can be used to favor the formation of monomer without Xis) which recognizes alt sites (Wcisberg el al. In: 

(lower concentration) or multimcrio circles (higher Lambda U, supra, pp. 211-250); 

concentration). 3) the xerC and xerD rccombinasc s of E. coli which 

Circular DNA molecules having a single loxl* site will together form a rccombinasc that recognizes the 28 bp dif 

recombine with a linear molecule having a single loxP site 3li site [Leslie and Sherratt (1995) EMBO J. 14:1561]; 

to produce a larger linear molecule. Ore interacts with a 4) the Tnt protein from the conjugative transpose n Tn9 16 

linear molecule containing two directly repeating loxP sites [Lu and Churchward (1994) EMBO J. 13:1541]; 

to produce a circle containing the sequences between the 5) Tpnl and the 0- lactamase transpose rus Qjevesque 

loxl* sites and a single loxl* site and a linear molecule (1990) J. Kacleriol. 172^3745J 

<»ntaining a single loxP site at the site of the deletion. 3S ^ ^ Tn3 resoivage [Flanagan ct aL (1989) J. Mol. Biol. 

'Die <>c protein has been purified lo homogeneity 206:295 and Stark ct al. (1989) Cell 58:779]; 

[Abremski et al. (1984) J. Mol. Biol. 259:1509] and the crc 7)meSporv-Cre^mbmascofB^i/ZttS5ufcriits[SatoetaL 

gene has been cloned and expressed in a variety of host cells j K acu .riol 172:1092]- 

[Abremski et aL (1983). ^pra]. Purified Ore protein is " ^ ^^'^ [Galsgow et aL (1989) Diol . 

available from a Dumber of suppliers (e.g., Novagen and 40 264-10072T 

"^^^C^X* a number „r variant or - ->- O«0 

mutant lox sites (variant relative to Ihc loxl* sequence), 73991J, and 

including the loxB, lox! 1 and InxR sites which are found in ™) lhtJ immunoglobulin recombmascs [Malynn el al. <xll 

the E. coli chromosome [Hoess et al. (1982), supra]. Other 45 (198*0 S4:453]. 

variant lox sites include loxPSII [S'-ATAACTTCGTATA <0 Modification of Expression Vectors 
GTAXACATT ATACG AAGTEAT-3' (SEQ ID NO: 16); As discussed above, pUNI vectors arc used to transfer a 
spacer region underlined; Hoess ct al. (1986), supra], loxC2 gene of interest into a suitably modified expression vector 
[5'-ACAAC TTCGTATA via site-specific recombination. The modified expression 
ATGTATG CTAIACGAAGTTAT-3 t (SCO ID NO:17); so vectors or host vectors used in the Univector Fusion System 
spacer region underlined; U.S. Pat. No. 4,959,317). Cre are referred to as pIIOST vectors. pIIOST vectors are 
catalyzes the cleavage of the lox site within the spacer region generally expression vectors (e.g., plasmids) which have 
and creates a six base-pair staggered cut [Hoess and Abrcm- been modified by the insertion of a sequence-specific reco ru- 
sk i (19K5) J. Mol. Kiol. 181*351]. The Iwo 13 bp inverted binasc target trite (e.g., a lox Kite). The presence of the 
repeal domains of the lox site represent binding sites for the 55 sequence-specific rccombinasc target site on Ihc pHQST 
Cre protein. If Iwo lox sites differ in their spacer regions in plasmid permits the rapid suhcloning or insertion of the gene 
such a manner that the overhanging ends of ihc cleaved interest contained within a pUNI vector lo generate an 
DNA cannot rc anneal with one another, (>c cannot cut- expression vector capable of expressing Ihc gene of in tcresL 
ciently catalyze a recombination event using the two differ- The pHOST vector may encode a protein domain such as an 
enl lox sites. For example, it has been reported that Cre 60 affinity domain including, but not limited to, glulathione-S- 
cannot recombine (at least not efficiently) a loxP site and a transferase (Gst), maltose binding protein (MBF), a portion 
loxPSII site; these two lox sites differ in the spacer region. of staphylococcal protein A (SPA), a polyhistidine Iract, etc. 
Two lox sites which differ due to variations in the binding A variety of commercially available expression vectors 
sites (ie., the 13 bp inverted repeats) may be recombined by encoding such affinity domains arc known to the art. The 
Cre provided that Cre can bind to each of the variant binding 65 affinity domain may be located at either the amino- or 
sites; the efficiency of the reaction between two different lox carboxy-tcraainus of tbc fusion protein. When the pHOST 
sites (varying in the binding sites) may be less efficient that plasmid contains a vector-encoded affinity domain, a fusion 
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protein comprising the vcclor-cn ct kIc d affinity domain and 
the protein of interest is; generated when the pUNI and 
pHOST vectors are re combined. 

*lh generate expression vccLors intended to generate tran- 
scriptional fusions (i.e., pHOST does not contain a vector- 
encoded protein domain), a sequence-spec i fie rccombinase 
target site is placed after (i.e., downstream of) the start of 
transcription in tbe host vector; this is easily accomplished 
using synthetic oligonucleotides comprising the desired 
scqucace-specific rccombinase target site. In designing the 
oligonucleotide comprising the sequence-specific rccombi- 
nase target silo, care is taken to avoid introducing an AiXi or 
start codon that might initiate translation inappropriately. 

To generate expression vectors intended to generate a 
fusion protein between a vector-encoded protein domain 
located at the amino -terminus of the fusion protein and the 
protein of interest (encoded by the gene of interest contained 
within the pUNI vector) (i.e., a translalional fusion), care is 
taken lo place the sequence-specific rccombinase target site 
in the correct reading frame such that 1) an open reading 
frame is maintained through the sequence-specific rccom- 
binase target site on pHOST and 2) the reading frame in the 
sequence-specific recombinase target site on pHOST is in 
frame with the reading frame found on the sequence-specific 
recombinase target site contained within the pUNI vector. In 
addition* the oligonucleotide comprising the scqueoce- 
speciiic recombinase target site on pIIOST is designed to 
avoid the introduction of in-frame stop codons. The gene of 
interest contained within the pUNI vector is cloned in a 
particular reading frame so as to facilitate the creation of the 
desired fusion protein. 

The modification of several expression vectors is pro- 
vided in the examples below to illustrate the creation of 
suitable pIIOST vectors. The general strategy involves the 
generation of a linker containing the desired scqucucc- 
speciiic recombinase target site (e.g., a lox site) by annealing 
two complementary oligonucleotides- *l>ie annealed oligo- 
nucleotides Conn a linker having sticky ends which arc 
compatible with ends generated by restriction enzymes 
whose sites are conveniently located in the parental expres- 
sion vector (e.g., within the polylinkcr of the parental 
expression vector). 

d) In Vitro Kecombinalion 

The fusion of a pUNI vector and a pIIOST vector may be 
accomplished in vitro using a purified preparation of a 
site-specific recombinase (e.g., Ore recombinase). The pUNI 
vector and the pIIOST vector are placed in reaction vessel 
(e.g., ajnicroccnlrifugc tube) in a buflcr compatible with the 
site -specific rccombinase to be used. For example, when a 
Ore recombinase (native or a fusion protein form) is 
employed, the reaction buffer may comprise 51) mM Tris- 
IIC1 (pll 7.5), 10 mM MgCLj, 30 mM NaCl and 1 mg/ml 
BSA. When a Fl-P recombinase is employed, the reaction 
buffer may comprise 50 mM Tris-HO (pH 7.4), 10 mM 
MgC^, 100/<g/ml BSA[Cronostajski and Sadowski (1985) 
260:12320]. The concentration of the pUNI vector and the 
pIIOST vector may vary between 100 ng to 1.0 /ig of each 
vector per 20 /d reaction volume with about 0.1 /^g of each 
nucleic acid construct (0.2 /jg total) per 20 /d reaction being 
preferred. The concentration of the site-specific rccombinase 
may be titcrcd under a standard set of reaction conditions to 
find the optimal concentration of cn/.yme lo be used as 
described in Hx. 4. 

Following the in Vitro fusion reaction, a portion of the 
reaction mixture is used to transform a suitable host cell to 
permit the recovery and propagation of tbe fused vectors. 



'fhc host cell employed will not express the trans-acting 
factor required for replication of the conditional origin of 
replication contained within the pUNI vector (or alterna- 
tively the host cell will be grown at a temperature which is 

s non-permissive for replication of a temperature sensitive 
replicon contained within the pUNI vector). The host cells 
will be grown under conditions which select for the presence 
of the selectable marker contained within the pUNI vector 
(e.g., growth in the presence of kanamycin when the pUNI 

10 vector contains the kanamycin resistance gene). Plasmld or 
noQ-chromosotnal UNA is isolated from host cells which 
display the desired phcootypc and subjected to restriction 
enzyme digestion to confirm thai the desired fusion event 
has occurred. 

15 e) Recombination in Procaryotic Host Cells 

'I Tic fusion of a pUNI vector and a pHOST vector may be 
accomplished in vivo losing a host cell that expresses the 
appropriate site-specific rccombinase (e.g., Crc 
recombinase). The host celt employed will lack the ability to 

2U express the trans-acting factor required for replication of the 
conditional origin of replication contained within the pUNI 
vector (or alternatively the host cell will be grown at a 
temperature which is non-pcrmissivc for replication of a 
temperature sensitive replicon contained within the pUNI 

25 vector). 

The pUNT vector and the pHOST vector are co trans- 
formed into the host cell using a variety of methods known 
to the art (e.g., transformation of cells made competent by 
treatment with GaCI 2 , electroporatton, etc.). The cotrans- 
30 formed host cells are grown under conditions which select 
for the presence of the selectable marker contained within 
the pUNI vector (e.g., growth in the presence of kanamycin 
when the pUNI vector contains the kanamycin resistance 
gene). Plasmid or non-chromosomal DNA is isolated from 
35 host cells which display the desired pbenotypc and subjected 
to restriction enzyme digestion to confirm that the desired 
fusion event has occurred. 

In addition to permitting the rapid transfer of a gene of 
interest from a particular pUNI vector containing a gene of 
40 interest into a pIIOST vector, the Univector Fusion System 
permits tbe rapid exchange of an entire cDNA library to a 
variety of expression vectors. The cDNA library is generated 
using a pUNI vector as the cloning vector (a pUNI library). 
The entire library may then be transferred (using cither an in 
4S vitro or an in vivo recombination reaction) into any expres- 
sion vector modified to contain a sequence-specific rccom- 
binase target site (e.g., a lox site) (i.e^ into a pHOST vector). 
'Ill is solves an existing problem in the art, in. that there is no 
way, using existing vector systems, lo exchange the inserts 
in a library made in one expression vector en masse (i.e., as 
an entire library) to a different expression vector. In addition, 
the sequences contained within a pUNI library can be used 
to re combine with linear X constructs (which can then be 
used to isolate specific genes by complementation of an 
appropriate host cells such as£. coU or 5. cerevisiae mutant 
cells). Further as described in Example 8, the in vivo gene 
trap method, a variation of the Univector Fusion System, can 
be used to transfer linear DNA fragments that lack a select- 
able marker such as a I'CR product into a variety of 
611 expression vectors. 

EXI'KIUMKNTAJL 

The following examples serve to illustrate certain pre- 
ferred embodiments and aspects oT the present invention and 
65 are not to be construed as limiting the scope thereof 

In the experimental disclosure which follows, the follow- 
ing abbreviations apply: °C. (degrees Centigrade); g 
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(gravitational field); vol (volume); DNA (deoxyribonucleic Nucleotides 1— 400 of pUNI-10 contain the conditional 

acid); RNA (rilxinuclcic acid); kdal or kD (kilodaltnns); OD origin of replication from RoTCv (OriR /<0JCv ); the OriRj^^ 

(optical density); EDTA (ethylene diamine tetra -acetic acid); was derived from the plasm id R6K (ATCC 371 20) [Me teal f 

E. cw/i (JCschcrichia coli); SOS (sodium dodecyl sulfate); el al. (1996) Plasmid 35:1 nucle<»tides 401-414 comprise 

PAGE (polyacryLamide gel electrophoresis); p (plasmid); s a Notl-Kpnl polylinker that facilitates the exchange of lox 

T.B (Tjuria-Bertani medium: per liter: 10 g Bacto-tryptone, 5 sites; pUNT-10 contains a wild-type loxP site (as discussed 

g yeast extract, 10 g NaCl, pll to 7.5 with NaOII); ml above, pUNI vectors containing modified lox sites may be 

(milliHter);/U(micronter); M employed). Nucleotides 415-446 comprise the wild-type 

(micro Molar); g (gram); /^g (microgram); ng (nanogram); U loxP site; nucleotides. 449-527 comprise a polylinkcr used 

(units), mU (milliunits); min. (minutes); sec. (sccoads); % to for the insertion of the gene o£ interest (genomic or cDNA 

(percent); bp (base -pair); kb (kilobase); PCR (polymerase sequences). Nucleotides 528-750 contain the polyAaddition 

chain reaction); Mris (txis(hydruxymelhyl)-aaxinomcuianc); sequence from bovine growth hoimone (BGH) [the UGH 

PMSF (pheoylmclhybiulfonylfluoridc); USA (bovine serum polyA sequence is available on a number of commercially 

albumin); IPTXi (isopmpyl-fl-l)-lhiogalactoside); ATXX* available vectors including pcDNA3.1 (Invilrngen)]; the 

(American 'type C ill lure C Collection, Rockvillc, McL); Hio- is BGH poly A sequence is provides a 3* end for transcripts 

Rad (Rio-Rad Corp., Hercules, Calif.); Invitrogen expressed in mammalian and other eukaryotic celts. The art 

(Tnvitrogen, Corp., San Diego, Calif.); New England is aware of other eukaryotic polyA sequences which may be 

N ad ear/1 )u Pont (Boston, Mass.); Nov age n (Novagen, Inc., used in place of the BGH polyA sequence (e.g., the SV40 

Madison, Wis.); Pharmacia or Pharmacia Biotech poly A sequence, the TK polyA sequence, etc.). Nucleotides 

(Pharmacia Biotech, Piscataway, NJ.); Pharmingen 2U 751-890 contain the T7 terminator sequence which is used 

(PharMingen, San Diegi, Calif.); Gibco BRL (Gaithcrsburg, to terminate transcription in prokaryotic hosts (numerous 

Md.); and Stratagene (Stratagene Cloning Systems, La Jolla, prokaryotic termination signals axe known to the art and may 

Calif.). be employed in place of the T7 terminator sequence). 

Nucleotides 890-895 comprise an EcoRV restriction 

EXAMPLE A 25 enzyme recognition site and nucleotides 896-2220 comprise 

Construction Of Univcclor Constructs me kanamyein resistance gene (Kan or Kn<8>) from Tn5 

which provides a positive selectable marker, i Tic Kxi gene 

In this example, an illustrative Univcclor construct is r ound OT1 pUNI-10 was modified using site-directed 

provided, the pUNl-lO vector, which contains a loxl' site, a mutagenesis u> remove the naturally occurring Ncol site 

kanamyein resistance gene (Kn®) and the R6Kv conditional ^ sucn mat pTJMi_iQ contains a unique Ncol site in the 

origin or replication ((WR^^-lT^OriRR^ is functional polylinker region located at nucleotides 449-527. pUNT 

only in E. coli strains expressing the TL replication protein vectors need not contain a Kn<£> gene (mcxlified or wild- 

(i.e., the product of the pir gene). A gene of interest ts placed type); other selectable genes may be used in place of the 

within pUNI-10 (either as a result of constructing a library Kn® gene (e.g., ampicillin resistance gene, tetracycline 

in pUNI-10 or by subcloning a previously cloned gene of 35 resistance gene, zeocin™ resistance gene, etc.). The pUNT 

interest). Once the gene of interest is contained within vector need not contain a selectable marker, although the use 

pUNI-10, any number of plasmid expression constructs of a selectable marker is preferred. When a selectable 

containing this gene of interest can be constructed rapidly marker is present on the pUNI vector, this marker is prcf- 

(e.g., within a single day). The expression constructs will crably a different selectable marker than that present on the 

contain an antibiotic resistance gene other than kanamyein 40 n HOSX vector. 

(eg., arnpieillin). Using the site-specific recombinase, Crc ^ flUclcoudc 5^^^ of pUNI-10 is provided in SCO 

a precise fusion between the pU N I vector and any other loxl* j^q. 
silc-conUining vector comprising the desired expression 

signals adjacent to the loxP site is calalyvixl The site- EXAMPLE 2 

specific recombination event which occuis between the 4S _ . .„ „ 

singleloxP sites located on each plasmid (e.g., pUNI and the * Construction or Host Hasmids H>r Use In Iric 
expression vector) results in the stable fusion of these two Unrvector Plasmid-Fusion System 
plasmids in such a manner as to place the expression of the Host plasmids used in the Univcclor plasmid fusion 
gene of .interest under the control of the expression signals system are referred to as pHOST plasmids. pHOST plasmids 
contained within the expression vector. This subcloning so or vectors are generally expression vectors which have been 
event occurs without the need to use restriction enzymes. modified by. the insertion of a lox site. The presence of the 
The fusion of pUNI-10 and the expression vector is selected lox site on the pHOST plasm Id permits the rapid subcloning 
for by selecting Cor the ability or E. coli cells thai do not or insertion of the gene interest contained within a pUNT 
express the 11 protein to grow in the presence of kanamyein. vector to generate an. expression vector capable of exprcss- 
pUNI cannot replicate in /','. coli cells that do not express the 55 fog th e gene of interest. The pHOST vector may encode a 
II protein unless pUNI has fused or integrated into another protein domain such as an affinity domain including, but not 
plasmid that contains a normal (ic., not a conditional) origin limited to, glutathione-S-transferase (Gst), maltose binding 
of replication (e.g., the Col El origin); in this case, pUNI will protein (MBP), a portion of sUphylococcal prolein A (SPA), 
be replicated (as part of the fusion plasmid) and kanamyein a polybistidinc tract, etc A variety of commercially avail- 
resistance will be conferred on the host cclL 6(l able expression vectors encoding such affinity domains arc 
a) Generation ofpUNMO known to the art. When the pHOST plasmid contains a 
FIG. 2A provides a schematic map of the pUNI-10 vector; vector-encoded affinity domain, a fusion prolein comprising 
the locations of selected restriction enzyme sites arc indi- the vector-encoded affinity domain and the protein of inter- 
catcd (with the exception of NotI, all sites shown are est is generated when the pUNI and pHOST vectors are 
unique). FIG. 2B shows the DNA sequence of the loxP site «5 re combined. 

and the poly linkers contained within pUNI-10 (i.e., nucle- To generate expression vectors intended to generate tran- 

otides 401-530 of SCO ED NO:l). scrip tional fusions (Le., pHOST does not contain a vector- 
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encoded protein domain), a lox site is placed alter (ic, 
downstream of) Ihc start of transcription in the host vector; 
thig is easily accomplished using synthetic oligonucleotides 
comprising the desired lox site. In designing the oligonucle- 
otide comprising the lox site, care is takea to avoid intro- 5 
ducing an ATG or start codon that might initiate translation 
inappropriately. 

To generate expression vectors intended to generate a 
fusion protein between a vector-encoded protein domain and 
the protein of interest (encoded by the gene of interest 1Q 
contained within the pUNI vector), care is taken to place the 
lox sile in the correct reading frame such thai 1) an -open 
reading frame is maintained through the lox site on pHOST 
and 2) the reading frame in the lox site on pHOST is in frame 
with the reading frame found on the lox site contained within 
the pUNI vector. In addition, the oligonucleotide comprising 1 5 
the lox site on pHOST is designed to avoid the introduction 
of in-frame stop codons. The gene of interest contained 
within the pUNI vector is cloned in a particular reading 
frame so as to facilitate the creation of the desired fusion 
protein. 21 

The modification of several expression vectors is pro- 
vided below to illustrate the creation of suitable pHOST 
vectors. In each case, the general strategy involved the 
generation of a linker containing a lox site by annealing two ^ 
complementary oligonucleotides. 'I "be annealed oligonucle- 
otides form a linker having sticky ends which arc compatible 
with ends generated by restriction enzymes whose sites arc 
conveniently located in the parental expression vector (eg., 
within the polylirikcr of the parental expression vector). ^ 

a) Modification of the pGEX-2TKcs Procaryotic Expres- 
sion Vector 

pGEX-ZFKcs is an expression vector active in tC, coti 
cells which is designed for inducible, intracellular expres- 
sion of genes or gene fragments as fusions with Gst. 35 
pGEX-2TKcs contains the BPTG -inducible tac promoter 
. (P,^) and was derived from pGEX-2TK (Pharmacia 
Biotech) as follows. The polylinker sequence of pGEX- 
2TK, 5'-GGATCCCCGGGAATTC-3' (SEQ ED NO:2), was 
replaced with the following sequence: 5'-GGAT ^ 
CGCATATGCCCATGGCTCGAGGAXCCGAAJTC-S* 
(SEO ID N03) to generate the pGEX-2TKcs vector. 

A linker containing a lox P sile was generated by annealing 
the following oligonucleotides: 5 -CATG G CTAXAACT 
TCGTATAGCATACATTATACGAA GTTATG-3' (SEQ ID 45 
NO:4) and 5 '-G ATCCATAACTTCGTATAATGTATGC 
TATACG AAGTTATAG C-3* (SEO ID NO:5). When 
annealed, these two oligonucleotides form a double-stranded 
linker having a 5' end compatible with an Ncol sticky end 
and a 3' end compatible with a Damin sticky end (FIG. 3A). 50 
pGKX-2TKcs was digested with Ncol and UamHl (WO. 3B) 
and the annealed loxP linker was inserted to form pgst-lox. 

b) Modification of the pVT,1392 Baculovirus Expression 
Vector 

pVL1392 is an expression vector that contains the poly- 55 
hedrin promoter which is active in insect cells 
(Itiarmingcn). A linker containing a loxl* sile was generated 
by annealing the following oligonucleotides: 
S'-OGCCGG ACGTC ATA ACTTOGTAT 
AGCATAOATTATACX5AAGTTATG-3 1 (SEQ ID NO:6) and eti 
5 ' -G ATC C ATAACTTC GTATAATGTATG CTATACG 
AAGTTATGACGTCC-3* (SEQ ID NO:7). When annealed, 
these two oligonucleotides form a double-stranded linker 
having a 5' end compatible with a NotI sticky end and a 3' 
end compatible with a D ami II sticky end (FIG. 4A). 65 
p VL1392 was digested with Not! and BamHl (FIG. 4B) and 
the annealed loxP linker was inserted to form pVL1392-lox. 
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c) Modification of the p( J A 1*24 Yeast Expression Vector 
pGAP24 is an expression vector that is based on the yeast 

2 jum circle and contains the constitutive GAP 
(glyceraldehyde 3-phosphate dehydrogenase) promoter 
(Pom**) which is active in yeast cells and the TRPi gene (used 
a selectable marker when the cells are grown in medium 
lacking tryptophan) [the GAP promoter is available on 
pAD23; Schilds (1990) Proc. Natl. Acad. Sci. USA 
87:2916]. A linker containing a loxP site was generated by 
annealing the following oligonucleotides: S'-TCGAGAC 
GTCATAACTTCGTATAG C ATACATTATACG AAGTTA 
TGC-3' (SEO ID NO:8) and S'-GGCCVJCAl'AACTTCG'lA 
•lAAlXJ'l^'GClAJACGAAG'n'AlGACXilX:-^ (SEQ ID 
NO: 9). When annealed, these two oligonucleotides form a 
double-stranded linker having a 5 1 end compatible with a 
Xhol sticky end and a 3' end compatible with a NotI sticky 
end (FIG. 5A). pGAP24 was digested with XhoT and Noll 
(KIG. 5B) and the annealed loxP linker was inserted U> form 
pGAP24-lox. 

d) Modification of the pGAI /1 4 Yeast Expression Vector 
pGAM4 is a yeast ccnlromcric expression vector that 

contains the GAI , promoter (I'gaO* which is induced by the 
presence of galactose in the medium, and the TRPI gene. A 
linker containing a loxP sile was generated by annealing 
together the oligonucleotides listed in SEQ II) NOS:8 and 9. 
When annealed, these two oligonucleotides form a double- 
stranded linker having a 5* end compatible with a XhoT 
sticky end and a 3* end compatible with a NotI sticky end 
(FIG. 6 A). pGAL14 was digested with Xhol and NotI (FIG. 
6B) and the annealed loxl 1 linker was inserted to form 
pGAL14-lox. 

EXAMPLE 3 

Expression And Purification Of A Gst-Crc Fusion 
Protein 

In order to provide a source of purified C!re recomhinasc 
for the in vitro recombination of plasmids, Ihc ere gene was 
inserted into a Gst expression vector such that a fusion 
protein comprising Gst at Ihe aniino-terminal end and Cre 
recomhinasc al the carboxy-lcrminal end was produced. *l"hc 
Gst-Crc fusion protein was purified by chromatography 
using Glutathione Sep h arose 4R (Pharmacia). 

The cre gene was isolated by polymerase chain reaction 
(PGR) amplification using the plasm id pBS39 (U.S. Pal. No. 
4,959,317). U.S. Pal. Nos. 4,683,195, 4,683,202 and 4,965, 
188 cover PCR methodology and are Incorporated herein by 
reference. The primers used in the PCR were designed to 
introduce an Ncol site at the first ATG in the cre open 
reading frame. The PCR product was cloned into a TA 
cloning vector (PCR0.1; Invitrogen) and then was subd- 
loocd as an NcoI-EcoRI fragment into pGEX-2TKcs (Ex. 2) 
to generate pQL123. The ligation products were used to 
IransCurm DH5ci cells and the desired recombinant was 
isolated and used to transform BL21(DE3) cells 
(Invitrogen). 

The nucleotide sequence of the Gst -Cre coding region 
within pOL123 is listed in SEO ID NO: 10. The amino acid 
sequence of the fusion protein expressed by pQL123 is listed 
in SEQ ID NO:ll. 

To express the Gst-Crc fusion protein, DL210DE3) cells 
containing the pQL123 pLasmid were grown at 37° C. in LB 
containing 100 /zg/ml ampicillin until the OD^qo reached 0.6. 
Expression of the fusion protein was then induced by the 
addition of 1PTG to a final concentration of 0.4 mM and the 
cells were allowed to grow overnight at 25° C. Following 
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induction, I he bacterial cells were pelleted by ccnlri (ligation 
at 5,000xg at 4° CI and the supernatant was discarded. A cell 
lysate was prepared as follows. Cells harvested from 0.5 liter 
of culture were suspended in 35 ml of a solution containing 
20 mM Tris-IICl, pll 8.0, 0.1M NaCl, 1 mM EDTA, 03% 
Nonidet P-40, 5 fig/ml of each of leupeptin, antrpain, apro- 
tinin and 1 mM PMSF at 4° C. The cells were incubated for 
10 min oa ice and then disrupted by soni cation (3x15 sec 
bursts) using a sonicator (Ultrasonic Heat Systems Model 
200R) at full power. The Lysate was then clarified by 
ccotrifugatioo at 12,000 rpm using a SS34 rotor (SorvaU). 

The Gst-Ore fusion protein was affinity purified from the 
cell lysate by chromatography on Glut at bio oe Scpharosc 4D 
(Pharmacia) according to the manufacturer's instructions. 
The protein concentration of Gst-Oc was deterrnined by 
Bradford analysis (BioRad). 

Aliquots of the cell lysate before and after chromatogra- 
phy on Glutathione Scpharosc 4K were applied In an SOS- 
PAGE gel. Following electrophoresis, the get was stained 
with Cnomassie blue. The stained gel is shown in FIG. 7. In 
FIG. 7, lanes 1 and 2 contain the cell lysate before and after 
chromatography, respectively. The arrowhead indicates the 
Gst-Cre fusion protein. The migration of the molecular 
weight protein markers is indicated to the left of lane 1. The 
results shown in FIG. 7 demonstrate the purification of the 
Gsl-Orc fusion protein. This fusion protein was shown to be 
functional (i.e., capable of mediating recombination 
between lux sites) in the in vitro recombination assay 
described below. 

EXAMPLE 4 

In Vitro Recombination Using The Univcctor 
Plasmid Fusion System 

*Ihc Univcctor Plasmid Fusion System permits the in vitro 
recombination of two plasm ids. FIG. 8 provides a schematic 
showing the strategy employed for in vitro recombination. 
pA represents a generic pUNI vector which contains a loxP 
site, a kaaamycin resistance gene and the conditional R6K 
origin that is only functional in E. cali strains expressing the 
IIII protein (e.g., E. coli strains BW18815, BW19094, 
BW20978, BW20979. BW2I037, BW21038). pB represents 
a generic pHOST vector which contains a loxP site, an 
ampicuUin resistance gene and a Col El origin of replication. 
pAB represents the fused plasmid which results from the 
Crc-xncdiaLcd fusion of pA and pB. 

lb illustrate the in vitro recombination reaction, pUNI-5 
(a pUNI vector which differs from pUNI-10 only in that 
pUNI-5 retains the Ncol site in the Kr*E> gene and -contains 
a different polylinkcr) was employed as pA and pQL 103, an 
a mp icfllin -resistant plasmid containing a loxP site and the 
(2olEl origin, was employed aspK. In a total reaction volume 
of 20/d, 0.2 ^g of eachpUNl-5 (pA) andpQU.03 (pB) were 
mixed in a buffer containing 50 mM Tris-HCI (pH 7S), 10 
mM MgC.lj, 30 mM NaCl and 1 mg/ml BSA. The amount 
of purified Gsl-Oc (Bx. 3) was varied from 0 to 1 -0/<g. The 
reactions were incubated at 37° C. for 20 minutes and then 
the reactions were placed at 70° C. for 5 min.. to inactivate 
the Gst-Cre protein. Five microliters of each reaction' mix- 
ture were used directly to transform competent DllSa cells 
(Cads treated). The transformed cells were plated onto 
LB/Amp (100 A/g/ml amp) and UB/Kan (40 /zg/ml kan) 
plates and the number of ampicillin resistant (Ap<&) and 
kanamycin -resistant (Kn®) colonics were counted, llic 
results are summarized in Table 1. 
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% of Ibtal 


Gst-Cre (^/tcaOtoo) 


Ap tt Colonies 


Ko R Colonics 


Kn a /Ap a 


0 


2.6 x 10* 


0 


0 


0-01 


1.Vx 10* 


571 


3 


0.05 


1.1 X 10* 


6H2 


6.2 


0J 


1.5 x 1C 1 


502 


3.3 


0.5 


03 x 10* 


104 


3.4 


IjD 


• 0.3 x 10* 


52 


1.7 



The results shown in Table 1 demonstrate, that under 
these reaction conditions 0.05 /ig purified Gst-Cre per 20 /d 

15 reaction yields the most efficient rate of plasmid fusion. 
Plasmid DNA was isolated from individual kanamycin- 
rcsistant colonics (using standard mini-prcp plasmid DNA 
isolation protocols) and subjected to restriction enzyme 
digestion to determine the structure of the fused plasmids. 

w This analysis revealed that plasmid DNA isolated from the 
kanamycin-rcsistant colonies represented a dimcr created by 
the desired fusion ofpUNI-5 and pQI .1 03 via the loxP sites. 
'Ill esc results demonstrate that the Uni vector Plasmid Fusion 
System can be used 20 to rapidly fuse two plasmids together 

2^ in vitro. 

EXAMPLE 5 

In Vitro Fusion Between ApUNI Vector 
Containing A Gene Or Interest And A Ijox- 
30 Containing Expression Vector Produces A Fused 
Vector Capable Of Expressing The Gene Of 
Interest 

In Example 4 it was demonstrated that the Univcctor 
Plasmid Fusion System can be used to rapidly fuse two 
plasmid constructs together in vitro. In this example , the 
ability of the Univector Plasmid Fusion System to fuse two 
plasmids together in a manner that places the gene of interest 
contained on the pUNI vector under the transcriptional 
^ control of a promoter contained on the pHOSTor expression 
vector in such a manner that a functional protein of interest 
is expressed from the fused construct. 

a) Insertion Of A Gene Of Interest Into The pUNI-10 
Vector 

4S 'l*hc eDN A encoding the wild-type yeast Skpl protein [Bai 
ct al. (199G) Cell 86:263] was cloned into the pUNI-10 
vector between the NdeT and BamHI sites to generate 
pUNI-Skpl; the yeast SKP] cDNA sequence is available as 
GenBank acccssioa no. U61764. Skpl is an essential protein 

50 involved in the regulation of the cell cycle in yeast. Yeast 
cells containing a temperature sensitive mutant of Skpl 
cannot grow at the no □-permissive temperature (37° C). 

b) In Vitro Fusion Reactions And Complementation 
Assays 

55 pUNI-Skpl was rccombincd with pGAl*24-lox (Ex. 2) and 
pGAf-14-lox (Ex. 2) using the in vitro reaction described in 
Ex. 4; 0.2 /ig of Gst-Cre was used per 20 fj\ reaction, 'llic 
resulting plasmid fusions were termed pG A 1*24 -Skpl and 
pGAr.l4-Skpl. pGAP24-Skpl and pGAI,14^Skpl were then 

60 transformed into the temperature sensitive (Ls) skpl -11 
mutant yeast strain Y555 (Bai et al, supra) and the trans- 
formed yeast cells were plated onto SC-tryptopban plates (to 
select for the expression of the selectable marker TRP1) and 
incubated at either a permissive (25° C.) or oon-permissive 

65 temperature (37° C); the plates which received yeast cells 
transformed with pGAL14-Skpl contained galactose. The 
ability of the transformed cells to grow at the non- 
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permissive temperature is dependent upon the expression of 
the wild-type skpl gone encoded by a properly fused pUNI- 
Skpl/cxprcssion vector construct. As a control, the yeast 
SKP1 genomic clone contained in a UUA CBN vector 
(produced by conventional cloning techniques) was used to 
transform the ts skpl-11 mutant yeast strain Y555 and the 
transformed cells were also plated at 25° C. and 37° C In 
each case, an expression vector (e.g., pRS414 or pRS4l5; 
Bai el aL, supra) Licking the SKP1 genu but containing the 
same selectable marker (ie., TRPJ) as either pGAP24-Skpl, 
pGAT/14-Skpl or URA OEN-Skpl was used to trans rarm 
■ Y555 celts as a control capable of permitting the growth of 
transformed Y555 cells on selective medium at tbe permis- 
sive temperature. 

'Iho rcsulLs demonstrated that the UUA GKN-SKP1 con- 
struct produced by conventional cloning techniques pro- 
duced a functional Skpl protein which was capable ol 
complementing the lethality of the SKP1-11 Is mutation. 
More importantly, the results demonstrated that the in vitro 
fusion reaction that created pGAP24-Slcpl and pGAL14- 
. Skpl produced constructs capable of producing functional 
Skpl; that is, Y555 cells transformed with cither pGAP24- 
Skpl or pGAL14-Skpl were capable of growth at 37° C. f a 
temperature at which the ts Skpl-11 protein produced by the 
host strain is non -functional. Expression vectors lacking the 
SKP1 cDNA were incapable of complementing the lethality 
of the skpl-11 ts mutation. 

c) Restriction Analysis, SDS-PAGH Analysis and Western 
Wot Analysis of In. Vitro Fusion Reactions 

pUNl-Skpl was recombdned with pGst-lox (Ex. 2) using 
the in vitro reaction described in Ex. 4; 0.2 fxg of Gst-Crc 
was used per 20 /d reaction. r Vbc resulting plasm id fusion 
was termed pGST-Skpl. FIG. 9 A provides a schematic 
showing the starting constructs and the predicted fusion 
construct. Five microliters of the fusion reaction mixture 
was used transform l)H5u cells as described in Ex. 4. 'llic 
transformed cells were plated onto LB/Amp/Kan plates and 
plasm id DNAwas isolated from individual Ap®Kn® colo- 
nies. The plasmid DNAs were digested with PstI followed 
by electrophoresis on agarose gels to examine the structure 
of the fused plasmids. A representative ethidium bromide- 
stained gel is shown in FIG. 9B. In FIG. 9D, lane "M" 
contains DNA size markers, lanes pUNI-Skpl and pgst-lox 
contain the starting plasmids digested with l J Stl and lanes 
1-12 contain plasmid DNA from individual AP®KrKg) colo- 
nics digested with PstI. Lanes marked with an indicate 
that these colonics contained a trimeric fusion plasmid that 
resulted from the fusion of two Gsl-lox plasmids and one 
pUNl-Skp! plasmid. The sizes of the two PstT fragments 
which result from the fusion of pUNI-Skpl and pg?4.-lox in 
kb are indicated (5.8 and 2.0 kb). Tbe results shown in FIG. 
9R demonstrate that the in vitro fusion reaction resulted in 
the production of the desired fused construct with high 
efficiency (about 83% of the plasmids in the AP<2>Kn<S> 
colonies comprised the fusion of one pUNl-Skpl vector with 
one pgst-lox vector). 

'Ihree individual Ap<©Kn® colonics were picked and 
grown in liquid cultures which were induced with IIM'G to 
examine whether the fused construct (pGst-Skpl) could 
produce the desired Gst-Skpl fusion protein. The cultures 
were grown, induced and cell extracts were prepared as 
described in Ex. 6. An aliquot of the cell lysates prepared 
from induced and uninduced cells were clectrophoresed on 
an SDS-PAGE gel and the gel was either stained with 
Coomaise blue or transferred to nitrocellulose to generate a 
Western blot. The Western blot was probed using an anti- 
Skpl polyclonal antibody (the antibody was raised against 
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the yeast Skpl using conventional methods). The resulting 
Coomassie-stai ned gel and Western blot are shown in FIGS. 
10A and I OR, respectively. 

In WG. 10A, lane **M" contains protein molecular weight 
5 markers (size in kd is indicated). Lanes marked "C* contain 
extracts prepared from E. cols containing a GST-SKPI 
construct made by conventional cloning [i.e., the SKPI 
cDNA was excised using restriction enzymes and inserted 
into pCEX-2TKcs (Ex. 2)]. Lanes 1-3 contain extracts from 
10 AP®Kn<8> cells transformed with in vitro fusion reaction 
mixtures. Extracts prepared from uninduced cells and IPTG 
induced ecus arc indicated by **— " and respectively. The 
arrowheads indicate the location of the Gst-Skpl fusion 
proteins; the Gst-Skpl fusion product generated from the 
15 pGST-SKPl fusion construct contains 15 additional amino 
acids which arc located between the Gst domain and the 
Skpl protein sequences relative to the Gst-Skpl fusion pro- 
tein expressed from the conventionally constructed GST- 
SKPI plasmid (the additional 15 amino acids are encoded by 
20 the linker comprising the loxP site; see FIG. 3). Tn FIG. 10B, 
the lane designations are the same as described for FIG. 
10A. This Western blot confirms that tbe bands indicated by 
the arrowheads in FIG. 10A represent Gst-Skpl fusion 
proteins. 

25 The results shown in FIGS. 10 A and 10B demonstrate that 
the Uni vector Fusion System can be used to create an 
expression vector which maintains the proper translational 
reading frame and permits the expression of a fusion protein 
comprising the expression vector-encoded affinity tag and 
the protein of interest. 

The above results demonstrate that the Unfvector Fusion 
System can be used to recomblne two plasmids, one con- 
taining a gene of interest but no promoter (this vector may 
35 optionally contain expression signals such as termination 
signals and/or polyadcnylation signals) and the other con- 
taining a promoter and optionally other expression signals 
(e.g., splicing signals, translation initiation codons) (and 
optionally sequences encoding an affinity domain) but lack- 
4^ ing a gene of interest, in vitro in such a manner that the 
proper translational reading frame is maintained permitting 
the expression of a functional protein from the fused plas- 
mids in the host cell. 

EXAMPI M 6 

45 

Construction Of An E. coli Strain That Inducibly 
Expresses Crc Recombinase 

An K. coli strain containing a crc gene under the control 
So °f an inducible promoter, termed the QLB4 strain, was 
constructed as follows. The ere gene was placed under the 
transcriptional control of the inducible lac promoter by 
inserting the ere ORF into a derivative of pNN402 [Elledge 
et al. (1 991) Proc. Natl. Acad. Sci. USA 88:1 731 ]; pNN402 
55 was modified In contain a lac promoter. This construct was 
then crossed onto lambda phage (e.g., Xgtll) using conven- 
tional techniques. The recombinant lambda phage carrying 
the lac-cre gene was integrated into the chromosome otE. 
coli strain JM107 to generate the OLD4 strain. 
6(i Expression of Cre recombinase was induced by growing 
QLB4 cells at 37° C. until an OD^ 0 of 0.6 was reached. The 
culture was then split into 2 parts and IPTG was added to one 
part to a final concentration of 0.4 mM. As a control, the 
BNN132 strain (ATCC 47059; Elledge et al (1991), supra) 
65 which contains the cre gene under the transcriptional control 
of the endogenous crc promoter was treated as described for 
the 0LB4 strain. Cell extracts (total protein) were prepared 
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from all Tour samples (QI,H4*1ITG and HNN132xirrO) In addition to placing Ihc cre ORK undeT ihc control or a 

and examined for expression or (>e recombinase by Western lightly controlled promoter, (!rc expression can he lightly 

blotting analysis. The Western blot was probed using a rabbit controlled by placing the etc gene on a plasmid containing 

polyclonal anti-Crc antibody (Novagcn) as the primary a temperature-sensitive (ts) rcplicoo (e.g., rep pSClOl 1 *). 

antibody and a goat anti-rabbit IgG horseradish peroxidase 5 When the ere gene is carried on a ts replication plasmid, Cre 

conjugate (Amersbamn) as the secondary antibody accord- will be expressed during the transformation of the host cell 

ing to the manufacturer's instructions. FIG. 11 shows a (because the host cell containing the ts plasmid containing 

Western blot containing extracts prepared from (shown left the cre gene was maintained at the permissive temperature) 

to rigfrl) BNNL23 cells grown in the absence of Il'lXi ("C) but will be absent following recombination of the pUNI and 

and QI.K4 cells grown in the absence ("OI-B4-") and pHOST vectors when the host cell is grown at a temperature 

presence of IPTC ("OLB4+"), respectively. The location of non-permissive for replication of the is replicon. 

the Cre recombinase band is indicated by the arrowhead. . 

The additional bands seen on this Wesrtern blot are due to EXAMPi J2 7 

cross-reactivity of the crude (Le., not affinity purified) rabbit i n Vivo Recombination In Procaryolic Hosts Using 

anti-Ore antibody with bacterial proteins. 'i^. Univcclor Fusion System 

Western blot analysis demonstrated that Cre protein could 15 A . , . „ . r . r _ . - r _ 

not be detected in BNN123 cells grown in the presence or ^sctibed tn Example 6. ^Ir^n^a oi E. coU 

absence of IFTC. Cre protein waTdetected in QUM cells C ^^T^ S QJ f \ "I™} 32 ) w ' th * 

grown in the presence of IFTO. but not in the absence «r P 1 ^. construct and a pIIOST construct (each construct 

IKIXi, by Western blot analysis. therefore, the expression of containing a single lox site) results in the fusion of these two 

Cre recombinase in OLB4 cells greatly induced by the 20 constructs in vrvo. If the host cell used for the recombination 

presence or ll'l'G in the growth medium. By tins analysis, reaction constitutivcly expresses the Cre protein, multimeric 

the expression of Cre recombinase in OLB4 ecus is depen- forms of the fused constructs are generated. In addition to 

dent upon the induction of the lac-cre gene by IPTG. the methods ou Uincd above for tightly rcgu la ting the cxprcs- 

However, more sensitive functional assays indicate that the sion of the etc gene in the host cell, cells constitutivcly 

Cre protein is expressed constitutive ly at very low levels in 25 producing Ore protein can be employed with modified pUNI 

both BNN132 cells and Ql ,B4 cells in the absence of 111X3. and pHOST vectors as described in this example. The pUM 

In these functional assays, a pUNT vector (Kn®) and a construct is modified such that two different lox sites flank 

pHOST vector (Ap®) were co transformed into QT.B4 cells the kanamycin resistance gene (the modified pUNT construct 

and the transformed cells were grown on plates containing is termed pUNI-D). The two lox sites differ in their spacer 

kanamycin to select for the presence of the pUNI-pIIOST $jj regions by one or two nucleotides and for the sake of 

fusion plasm id. Plasmid ON A was isolated from individual discussion the two different lox sites are referred to as 

kanamycin-resistant colonies and subjected to restriction "loxA" and "loxB" (e.g., loxP and loxP511; "loxb" is used 

enzyme digestion to examine the structure of the plasmid in this discussion to distinguish it from the first lox site 

UNA. This analysis revealed that multiple isoforms of the termed "loxA" and docs not indicate the use of the loxB 

plasmid fusion product were present in the plasmid DNA 35 sequence found in the E. coli chromosome). Cre cannot 

isolated from any single kanamycin-resistant colony. While efficiently catalyze a recombination event between a loxA 

not limiting Ihc present invention to any particular site and a loxb due to the sequence changes located in the 

mechanism, it is believed that low level constitutive cxprcs- spacer regions between the Cre binding sites; however Cre 

sion of (>c recombinase leads to multiple fusion cvenLs can efficiently catalyze the rccombinati on between two lox A 

between the pUNI and pHOST vectors resulting in the 40 sites or two lox B sites [Hoess ct al. (1986) Nucleic Acids 

production of multimeric forms (ie., trimer, tetramer, etc.) of Res.l4:2287]. The pHOST construct is modified such that 

the fused plasmid (the desired fused plasmid is a dimer one loxAsite and one loxB site flank the selectable marker 

formed by fusion of pUNI and pHOST). 'llie multimeric gene (the modified pHOST construct is termed pHOST-D). 

plasmid fusion products would be expected to be unstable In this example, pIIOST contains the sacB gene as the 

due to the fact that the Cre protein is constitutive I y expressed 4S selectable marker (a negative selectable marker). The pres- 

in OLB4 cells. ence of the sacB gene on pIIOST-D provides a means of 

To overcome the potential problems that low level con- counter-selection as cells expressing the sacB gene are killed 

slit u live expression of the cre gene in the host cell may when the cell is grown in medium containing 5% sucrose 

cause, the expression of cre can be more lightly control led [Gay ct al. (1985) J. Dacteriol. 164:918 and (1983) J. 

as described below. In addition the approaches described so BacterioL 153:1424]. 

below, the pUNI and pIIOST vectors can be modified as FIG. 12 provides a schematic showing the strategy for in 

described in Example 7 and these modified vectors can be vivo recombination in a Cre -expressing host cell (e.g., 

fused using a host cell that constitutivcly expresses the Cre QT.B4 cells) using the pUNl-D and pHOST-D constructs, 

protein. Arrows are used to indicate the direction of transcription of 

'Ihc expression of Cre recombinase can be more lightly 55 various genes or gene segments in FIG. 12. In FIG. 12, the 

controlled by a variety of means. For example, the expres- following abbreviations are used: Ap<B> (amptcillin resis- 

sicin of the cre gene can be made conditional when express- . tance gene); Kn<8> (kanamycin resistance gene); Ori (oon- 

ing cre under the control of the lac promoter by growing the conditional plasmid origin of replication); Uri<& (the R6Ky 

host cells in medium containing glucose. The presence of conditional origin of replication); Cre (Cre recombinase); 

0.2% glucose in the growth medium virtually shuts down 6(1 GENEX(gcnc of interest). The strategy outlined in FIG. 12 

transcription' form the lac promoter. In addition, the lac is referred to as the w in vivo genc-lrap". FIG. 12 illustrates 

promoter can be modified to insert additional operator (o) ibat the second lox site (loxB) in pUNI-O (relative to the 

sites which bind the Lac repressor. Other tightly controlled design of the pUNT-10 vector) ts inserted between the 

* promoters arc known to the art [e.g., the T7 promoter which kanamycin resistance gene and the R6Ky conditional origin 

requires the expression of T7 RNA polymerase; these pro- 65 of replication. 

motors arc available on the pHT vectors (Novagcn)] and may To generate a pHOST-D construct, a commercially avail- 
be employed to control the expression of the cre gene. able expression vector containing the desired promoter (and 
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optionally enhancer) is modified as described in Ex. 2 lo 
insert the lox A site downstream of the promoter (it is not 
necessary that a commercially available expression vector 
be employed as the art is well aware of methods for the 
generation of expression vectors). Sequences encoding the 
sacB gene [Gay ct al. (1983) J. Bacteriol. 153:1424; Gen- 
Bank accession nos. X02730 and KOI 987] and the second 
lox site (loxB) are inserted downstream of the first lox site 
(loxA). 

*lbc pUNl-D and pHOST-D constructs arc co transformed • 
into QI ,K4 cells (Ex. 6) and the transformed cells arc plated 
onto l,B/Ap/Kn plates containing 5% sucrose to select for 
the desired recombinant. FTG. 12 i I hurt rates the recombina- 
tion events which will occur in the presence of Ore in the 
QI.B4 cells. First pUNI-l) and pHOST-O will fuse to form 
two dimers in which two possible double cross-over events 
can occur. These two double cross-over events are dia- 
grammed in FIG. 12. The double cross-over events will 
result in the exchange of the DNA segments that are flanked 
by loxA and loxB to produce the plasmids labelled "A" and 
"B." All plamsids which contain the sacB gene (the pIIOST- 
D, the fused plasmids and plasmid B) will be selected 
against by the presence of sucrose in the growth medium. 
'1 *hc pUNl-D construct will not be able to replicate in QLB4 
cells as these cells do not express the II protein required Pot 
replication of the R6Ky origin.* I "here fore, the only construct 
will be maintained in QT-B4 cells selected on f,B/Kn con- 
taining sucrose is the desired plasmid A in which the gene of 
interest from pUNI-15 has been placed under the transcrip- 
tional control of the promoter located on pIIOST-D. 

pUNl-10 was modified to place a second lox site, com- 
prising the loxP5l1 sequence (SBQ ll> NO:16) between the 
icanamycin resistance gene and the R6Ky conditional origin 
of replication lo create pUNl-10-D. A second lox site, 
comprising the loxPSll site, was inserted onto a loxP- 
containing expression plasmid (i.c., a pHOST vector) to 
create a pIIOST-D vector. One-half of one microgram of 
each plasmid was co transformed into competent QT,B4 cells 
and an aliquot of the transformed cells were plated onto 
TJB/Ap plates and onto LB/Ap/Kn plates containing 5% 
sucrose and the number of colonies on each type of plate 
were counted. The percentage of AP®Kn<© colonies which 
grew on sucrose-conuining plates relative to the number of 
Ap® colonies was 1% (1x10?/ 1x10 s ). Restriction enzyme 
digestion of plasmid UNA isolated from individual 
Ap<2>KW<8) colonies which grew on sucrose -containing 
plates confirmed that the desired fusions had been generated, 
'fhese results indicate that the in vivo gene trap method can 
be used. to recombine a gene of interest carried on a pUNT-D 
vector into an expression vector using host cells that con- 
sthxtively express the Cre protein. 

in addition to providing a means for rc combining a gene 
of interest carried on a pUNl-D vector into an expression 
vector using host cells that eonstilu lively express the Oc 
protein, the in vivo gene trap method provides a means to 
transfer a gene of interest contained on a linear DNA 
molecule (e.g., a l*( !R product) that lacks a selectable marker 
into an expression vector(s). The desired PCR product is 
amplified using two primers, each of which encode a dif- 
ferent lox site (a "lox A** and "loxB" site such as a loxP and 
loxP511 site). ApUNl vector is constructed that contains (5* 
to 3*) a LoxA site, a counter-selectable marker such as the 
sacB gene and a lox B site (i.e., the two different lox sites 
Hank the counter-selectable marker). This pXJNI vector also 
contains a conditional origin of replication and an antibiotic 
resistance gene as described above and in Hi. 1. The PCR 
product (loxA-amplified sequence -loxB) is recombined with 



the modified pUNI vector (which comprises loxA-counlcr- 
selectahle marker-loxB) to create a pUNT vector containing 
the PCR product which now lacks the counter-selectable 
marker; this recombination event is selected for by growing 

5 the host cells in medium which kills the host if the counter- 
selectable gene is expressed. The PCR product in the pUNT 
vector (containing 2 lox sites) can then be placed under the 
control of the desired promoter element by recombining the 
pUNl/l*CR product construct with the appropriate pHOST-D 

io vector. 

EXAMPLE 8 

The Use Of Modified LoxP Sites To Increase 
j5 Expression Of The Protein Of Interest 

The pUNI and pIIOST constructs employed in the 
Univector Plasmid Fusion System were designed such that 
plasmid fusion results in the introduction of a lox site 
between the promoter and the gene of interest. LoxP sites 

2o consist of two 13 bp inverted repeats separated by an 8 bp 
spacer region [Uoess el aL (1982) tfroc Nail. Acad. Sci. 
USA 79:3398 and U.S. PaU No. 4,959,317]. Transcripls or 
the gene of interest produced from a pUNI-pHOST fusion 
construct comprising a loxP site may have two 13 nucleotide 

25 perfect inverted repeats within the 5* untranslated region 
(UTR) which have the potential to form a stem-loop struc- 
ture (this will occur in those cases where pHOST does not 
encode an affinity domain at the amnino-terrnlnus of the 
fusion protein). It is currently believed that the ribosome 

3U scanning mechanism is the most commonly used mecha- 
nism for initiation of translation in eukaryotes (e.g., yeast 
and mammalian cells). Using this mechanism, the ribosome 
binds to the 5* cap structure of the mRNA transcript and 
scans downstream along the 5' UTR searching for the first 

35 A1X3 or translation start codon. Without limiting the present 
invention to any particular mechanism, it is possible that a 
stem-loop structure formed by the presence of a loxP 
sequence on the 5 1 U'fR of the mRNA encoding the protein 
of interest would block or reduce the efficiency ribosome 

4n scanning and thus the translation initiation step could be 
impaired. There is evidence that stem -loop structures in the 
5* UTR of particular mRNAs reduce the efficiency of 
translation in eukaryotes [see, e.g., Donahue et aL (1988) 
MoL Cell. Diol. 8:2964 and Yoon et al. Genes and Dev. 

45 (1992) 6:2463]. It is noted that no evidence suggests that the 
presence of a stem-loop structure in the coding region (as 
opposed to the 5' U'l'R) of a transcript negatively affects its 
ability to be translated. It is likely that the energy of protein 
synthesis is sufficient lo tivcrcome secondary structures 

50 present in mRNAs. Indeed the data presented in Ex. 5 shows 
that a GST-SKPl fusion construct produced using the 
Univector Fusion System (i.c, the construct contains a loxP 
site between the sequences encoding the Cst and Skpl 
domains) produced the same level of fusion protein as did a 

55 conventional construct encoding a Osl-Skpl fusion protein 
which lacks the loxP sequence. Therefore, concerns over the 
presence of a stem-loop structure caused by the presence of 
a lox sequence in a transcript encoded by a pUNI -pIIOST 
fusion construct are limited to those constructs which do not 

6() generate fusion proteins. 

If low levels of expression are observed when a geoe of 
interest is expressed from a pUNl-pIIOST fusion constructs 
comprising lox sequences that comprise perfect 13 bp 
inverted repeats (e.g., loxP), pUNI and pIIOST constructs 

65 containing mutated loxP sequences arc employed. The 
mutated loxl* sequences comprise point mutations that cre- 
ate mismatches between the two 13 bp inverted repeat 
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sequences within the loxP site which disrupt the form a lion 
of or reduce the stability of a stem loop structure. 
Specifically, two modified loxP sites were designed that 
have mismatches at different positions in the inverted 
repeats located within a loxP site. The 13 bp inverted repeats 
are binding sites for the Cre protein; thus, each loxPsite has 
two binding sites for Cre. For the purpose of discussion, 
these two binding sites are referred to as L aad R (left and 
right). The wild -type loxl* site is designed L(0)-K(0) 
wherein "0" indicates the absence of a mutation (ie., the 
wild -type sequence). Two derivatives of the wild-type loxP 
sequence were designed and termed loxl*2 and loxl'3. The 
sequence of loxl>2 (SEQ ID NO: 13), loxF3 (SEQ ID 
NO: 14), as well as the wild-type loxP sequence {SEQ ID 
NO:12) are shown in HCi. 13. Ix>xl*2 is placed on the 
pUNI-10 construct (in place of the wild-type loxP site) and 
loxP3 is placed on the pHOST construct. 

I x)xP2 has repeals designated 1 ^3 f 6)-U(0) which indicates 
that the third and sixth nucleotides of the left repeat are 
mutated; thus, a mismatch is introduced at the third and sixth 
positions between the T. and R repeats of the loxP2 site. 
LoxP3 has repeats designated L(0)-R(9) which indicates that 
the ninth nucleotide on the right repeat sequence is mutated 
to introduce a mismatch at the ninth position between the L 
and R repeats of the loxP3 site. Fusion between the loxP2 
site on the pUNl construct and the 10x1*3 site on the pHOST 
construct will generate a hybrid loxl*23 site [L(3,6)-U (9)] 
located between the promoter and the gene of interest and a 
wild-type loxPsite [I <«)-R( 0 )] al 11,(5 <M»«d junction. 'Irtux, 
the 10x1*23 site (SHQ ID NO: 15) in the 5' UTR wfll have 
three mismatches distributed al positions 3, 6 and 9 between 
the 13 nucleotide inverted rcpcaLs which arc expected to 
strongly destabilize the formation of the stem-loop structure. 
Other mutated loxP sequences suitable for disruption of the 
stem -loop structure will be apparent to those skilled in the 
art; therefore, the present invention is not limited to the use 
of the loxP2 and loxP3 sequences for the purpose of dis- 
rupting stem-loop formation on the 5' UTR of transcripts 
produced from pUNl-pIIOST fusion constructs. The suit- 
ability of any pair of mutated lox sites for use in the 
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Uni vector Fusion system may be tested by placing one 
member of the pair on a pUNl vector and the other member 
on a pHOST construct The two modified vectors arc then 
rc combined in vitro as described in Ex. 4 and the fusion 

5 reaction mixture is used to transform K. call cells and the 
trans Conned cells are plated on selective medium (e.g., on 
T.R/Amp and TJR/Kan plates) in order In determine the 
efficiency of recombination between the two mutated lox 
sites (Ex. 4). The efficiency of recombination between the 

10 two mutated lox sites is compared to the efficiency of 
recombination between two wild-type lox sites. Any pair of 
two different mutant lox sites that re combines at a rate that 
is about 5% or greater than that observed using two loxP 
sites is a useful pair of mutated lox sites for use in avoiding 

15 the formation of a stem-loop structure on the 5* UTR of the 
mRN A Uanscribed from the pUNi/pHOST fusion construct. 

It will be apparent to those skilled in the art that a similar 
strategy can be employed for the modification of fit sites 
when the FLP recombinase is employed for the recombina- 

2U tiofl event. The frt site, like lox sites, contains two 13 bp 
inverted repeats separated by an 8 bp spacer region. 

It is clear from the above that the present invention 
provides methods Cor the subeloning of nucleic acid mol- 
ecules that permit the rapid transfer of a target nucleic acid 

2S sequence (eg., a gene of interest) from nucleic acid mol- 
ecule to another in vitro or in vivo without the need to rely 
upon restriction enzyme digestions. 

All publications and patents mentioned in the above 
specification arc herein incorporated by reference. Various 

30 modifications and variations of the described method and 
system of the invention will be apparent to those skilled in 
the art without departing from the scope and spirit of the 
invention. Although the invention has been described hi 
connection with specific preferred embodiments, it should 

35 be understood that the invention as claimed should not be 
unduly limited to such specific embodiments. Indeed, vari- 
ous modifications of the described modes for carrying out 
the invention which are obvious to those skilled in molecular 
biology or related fields are intended to be within the scope 
of the following claims. 



SiOJULMCli LUTING 

( 1 ) CIliNliRAI. INKMtMATION: 

(ill ) NTIMRETl OF SRQ1 fKNCRS: IR 

( 2 ) (NCOKMATK)N K)U SliQ. l» N<>:1: 

( i ) SEQUENCE CHARACTERISTICS: 

< A > IJ<N<m t; 7221) torn pair* 

< II )TYWi: Modefe mcid 

( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: circaUx 

( I I ) MOI J«:WIJ{TY Pit other nidclcAcftd 

< A ) DESCRIPTION: Afcac = "UNA™ 

(x 1 ) SKOUKNCIilMCStMIKIiON: S10O II) N<h1: 

AATTCTOTCA OCCOTTAAGT OTTCCTOTOT CACTOAAAAT TOCTTTGAGA OOCTCTAAOO 60 

aCTTCTCAOt OCOTTACATC CCIOOCTTOT TOTCCACAAC COTTAAACCT TAAAAOCTTT 120 

AAAAOCCTTA TATATTCTTT TTTTTCTTAT AAAACTTAAA ACCTTAOAOG CTATT.TAAOT 180 

TOCTOATTTA TATTAATTTT ATTOTTCAAA CATCAOAOCT TAOTACOTGA AACATOAQAO 240 
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-continued 



CTTAQTACOT TAOCCATGAO AOCTTAOTAC GTTAGCCATG AOOOTTTAGT TCOTTAAACA 300 

TOAOACCTTA OTACUTTAAA CA'I'O A(J AUCT TA(J T ACOTOA AACATOAOAO CTTAOTACOT 36 0 

ACTATCAACA OOTTOAACTO CTOATCAACA OATCCTCTAC OCOOCCOCOO TACCATAACT 420 

TCOTATACJCA TACATTATAC CAACTTATCT COAATTCCCC CCCCVCCAOA ACATA'l'COCC 48 0 

ATOGOOATCC OfflOCCOCAA TTGTTAACAfl AT CCGTCGAC OAGCTfOCTA TCAflCCTCGA 540 

CTGKKICTrC T ACJTTCJCCAC1 CCATCT (!T T <» TTTOCCCCTC CCCCOTflCCT TCCTTGACCC 600 

TGGAAGOTGC rACTCCfACT GTGCTTTCCT AATAAAATGA GGAAATTGCA TCGCATTOTC 66 0 

TCIACIT ACIGTCJ TCATTCTATT CTG CI Ci CI CI CI T CJ CI CI (IT CICi €1 CJC A OC AC A (1<: A AC (KIGOAGOATT 7 20 

GGGAAGACAA TAOCAOOCAT OCTOGGOATT CTAOAAGATC CGOCTOCTAA CAAAOCCCGA 780 

AAGGAAGCTG A CJ T TCJ CJ C T CI C TCI CCA CT CI CI CT .CAGCAATAAC TAGCATAACC CCTTOCIOdCC M4 0 

TCTAAACOOO TCTTOAGGOG TTTTTTOCTO AAAOGAGGAA CTATATCCOO ATATCCCGGG 9 00 

c;tc;cjc;c:c;aac; aactccagca tci a cja t cc cc: c;c:cjc:tcj o acici atcatccagc c:« ci ccjt c: ccc; 96 o 

gaaaacgatt ccoaagccca acctttcata gaaggcogcg gtggaatcga AATCTCGTGA 1020 

T C! C 1 C A Ci <« T T C J (i(! CCiTCG CTT Ci CI T C <1 (I T C A T TTCGAACCCC ACJ A CJ V CC C CJ C T C A Ci A A « A A C 1QH0 

TCGTCAAGAA CCCCATAGAA GGCGATGCOC TGCGAATCGG GAGCGGCGAT ACCGTAAAGC It 40 

ACGAGG AAGC GGTCAOCCCA TTCGCCGCCA AGCTCTTCAG CAATATCACG GGTAGCCAAC 1200 

GCTATGTCCT OATAGCGGTC COCCACACCC AGCCCGCCAC AGTCGATGAA TCCAGAAAAO 1260 

CGGCOATTTT CCACCATGAT ATTCGGCAAG CAGOCATCGC CATGOGTCAC GACflAGATCC 1320 

TCGCCGTCGG GCATGCOCOC CTTGAGCCTG GCGAACAGTT CGGCTOOCGC GAGCCCCTGA 1380 

TGCTCTTCGT CCAGATCATC CTGATCGACA AGACCGGCTT CCATCCGAOT ACGTGCTCGC 1440 

TCGATGCGAT GTTTCGCTTG GTGCTCGAAT GGGCAGGTAG CCGGATCAAG CGTATGCAGC 1500 

CGCCGCATTG CATCACCCAT GATGGATACT TTCTCGGCAG GAGCAAGGTG AGATGACAGG 156 0 

AGATCCTGCC CCGGCACTTC GCCCAATAGC AGCCAGTCCC TTCCCGCTTC AGTGACAACG 1620 

TCGAGCACAG CTGCGCAAOG AACGCCCGTC GTGGCCAGCC ACGATAGCCG COCTGCCTCC 1680 

TCCTGCAGTT CATTCAGGGC ACCGGACAGG TCGOTCTTGA CAAAAAGAAC CGGGCGCCCC 1740 

TGCGCTGACA OCCOGAACAC GGCGGCATCA GAGCAGCCGA TTGTCTOTTG TGCCCAGTCA 1800 

TAOCCOAATA OCCTCTCCAC CCAAGCOOCC GGAGAACCTG CGTOCAATCC ATCTTGTTCA 1860 

ATCATOCGAA ACGATCCTCA TCCTGTCTCT TGATCAGATC TTGATCCCCT GCGCCATCAO 1920 

ATCCTTOOCO OCAAGAAAGC CATCCAGTTT ACTTTOCAOO GCTTCCCAAC CTTACCAOAO 1980 

<;(;<:<;<:<: <:ca<; c: tcjc; c a att c ccjcjtt cgctt gctgtccata aaaccoccca err ctac; ct at 2040 

CCJCCATGTAA OCCCACTOCA AOCTACCTOC T1TCTCTTTO COC7TUCOTT TTCCCTTOTC 2 100 

CAOATAOCCC AGTAGCKIAC ATTCATCCGG (KIT CA(!C ACC CI T T T CJ T C) CT CI CJ ACTddCTTTC 2160 

TACOTCTTCC OCTTCCTTTA CCACCCCTTG COCCCTOAOT OCTTCCOO'CA OCCTGAAOCT 2 2 20 



( 2 ) INFORMATION FOR SEQ ID NOifc 

( I ) SRQl IHNCR CHARA1 - 1 KULM ICS: 

( a ) r JiNcmt \a hu*> p-in» 

( B ) TYTE; nvcLdc *dd 

( <: ) STKANI>ia>NI{SK: doable 

< D ) TOPOLOGY: linuir 

( 1 i ) MOLECULE TYPE: other audetc *dd 

( A ) DHSCKIPnON: fAe*c - "DNA" 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



( 
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OOATCCCCOO OAATTC 16 



( 2 ) INFORMATION FOR ST50 n> NO^I: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LUNCni: 3* bw p*irm 
( R ) TVFR: nocUdc K&I 
( C ) STRAKORDNRS& diwMo 
( D ) TOrOLOGY: linear 

( ] 1 ) MOLECULE TYPE: odjcr aoddc •citl 
t ( A ) OFSCRrPTTON: Afcac - "UNA" 

( v ( )NKQUI£NCK DliSCRtFTION: KltQ ID NCh3: 

GOATCOCATA TGCCCATOGC TCOAGOATCC OAATTC 36 



( 2 ) INFORMATION FOR SEO ID NO.-4: 

< 1 ) SEQUENCE CHARACTERISTICS: 
( A ) I J(N<rnii 42 bate pilm 
( B ) TYPE: avdvlc ackl 
( C ) STRANDEDNESS: *lnf$u 
( D ) TOPOLOGY tmear 

( S 1 ) MOLECULE TYPE: otbci nucleic »cld 

< A ) DESCRIPTION: fikxc = TJNA" 

( « 1 ) STCQI ITTNCPi tJTiSCRTFTKIN: SFXJ TO NO:*: 

CATGOCTATA ACTTCGTATA OCATACATTA TACCAACTTA TG 42 



< 2 ) INFORMATION FOR STCQ ID NOJfc 

( A ) I J&*GTH: 43 (km pain. 
( R )TYTTVaad«MC ■eVI 
( C ) STRANDEDNESS: «£ry£e 
( I) )Tt>KMjCXrV^ Ifaau- 



( 1 I ) MOLECirTKTYFK: dlcr nadcie k'k! 

( A ) DESCRIPTION: /doc • "DNA" 

( x 1 ) SEQUENCE DESCRITTTON: SEO ID NOtS: 

GATCCATAAC TT COTAT AAT OTATGCTATA CCAAGTTAT A GC 



< 2 ) INFORMATTOCN FOR SEO CD NOrf: 



( f ) si tuuiiw at <:iiama<: runtimes: 

< A ) LENGTH: 46 team jnia 
( B ) TYTE: ondtw »cjJ 
( C ) STRANDEDNESS: dn^o 
( I) )TOI*OMKra IWur 

"'V. 

(11) MOLECULE TYTE: uUjk auctek »cid 

( A ) LMiSOALFllON: fOeac - "UNA" 

( » I ) NKOUliNCrK DltSCKIKI'KIN: SltQ It) N<>:6: 

GOCCGGACUT CATAACTTCO TAT AG CAT AC ATT AT AC OA A GTTATCJ 



( 2 ) INFORMATION FOR SEQ ID NO:7: 

( I ) SEQI IRNCR CHARAC ; I HU IS 1 1 CR: 
( A ) I TOMCm 1": AC* Kim jmlr* 
( B ) TYTE: nvdclc add 
( <: ) STKANDIJ)NI<SS: *tnd« 
( O ) TOPOIOOY: lima* 

( 1 t ) MOLECULE TYPE: ocbex iroctdc mAA 

( A ) IMOfOtlFIION: /de*e - "UNA" 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ED NO-.7: 



, J,. 
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OATCCATAAC TTCOTATAAT OTATOCTATA COAAOTTATO ACOTCC 46 



< 2 ) INFORMATION FOR STXJ ID NOJfc 

< 1 ) SEQUENCE CHABACTERISTTCS: 
( A ) IXNCTIL* 4o b**e p»b» 
( B > TVFR: naduc «ckl 
( C ) STOANDRTJNRSS: mtntfv 
( D ) TOPOLOGY (max 

( 1 I ) MOLECULE TYPE: otl« ouduk »cld 
# ( A ) DBSCRimOK; /dewc • T)NA" 

( x t ) MKOMIINCK l»{SCK(KTKtN: S4iQ tl> N(hK: 

TCOAOACOTC ATAACTTCOT ATAOCATACA TTATACOAAO TTATOC 4 6 



( 2 ) INFORMATION FOR SEO ID NOA 

( i ) SEQUENCE CHARACTERISTICS: 

(A)IJ{N<mk44bMcpin ' 
( B ) TYTE: ovdeae acid 
( C > STRANDEDNESS: am^lc 
( D ) TOCOLOGY Uxxsr 

( 1 1 ) MOLECULE TYTE: <*faa micLck *dd 

( A ) DESCRIFTSON: /doc «s "DNA™ 

( x I ) SnOUHNCft nnSCRIPTBON: STXJ ID NO:9: 

GOCCGCATAA CTTCCTATAA TCTATOCTAT ACOAAOTTAT GACOTC 4 6 



< 2 ) INFORMATION FOR STO ID NTfclO: 

< i )»UiQUliNCl£OIAttAC*HiRJStTCS: 

< A ) IJRNOTK; 1740 b.~ palm 
( R ) TYPEi nu*4e»c warf 
( C ) STRANDEDNESS: docblc 
( I) JTDKMiK.'Y Ihor 

( 1 1 ) MOl JX3 n K TVFR: c*J«^ nodolc »cVl 

( A } DESCRIPTION: AScoc — "DNA" 

(Ik) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: L.1737 

( A i ) SEQUENCE DESCRIPTION; SEQ ID NO:lQ: 



AT O TCC CCT 

Met Ner Pro 
1 

ACT CO A CTT 

Ttir A r q Leu 



TAT O AO COC 

Ty r (J I - A r ff 
3 5 

OUT T T O GAG 

s a 

TTA AC A CAC 

I. C « T h r O I e. 

6 5 

ATC T T <; GOT 

Met 1. c a O I y 



ccja c;c:o ctt 

Oly Ala V u I 



ATA CTA OOT 

lie tern. <! I y 

S 

CTT TTG OA A 

I. « m I. c « (III. 
3 0 

OAT OAA OOT 

A«p. (lit Oly 



TTT CCC A AT 
P h « F«o A • n 



T C T ATC CCC 

S o r Met Alt 
7 0 

CICJT TCT CCA 

Oly C r * Fro 
8 S 

T "I' <! CJ A T A T T 

Lou Aip lie 
10 0 



TAT TOO AAA 
T y r T r p I. y » 



TAT CTT OAA 

Tyr I. c u O I u 
2 5 

OAT AAA TOO 

Anp I- y * T- r p 
4 t) 

CTT CCT TAT 

1. a k Pro Tyr 

5 5 

ATC ATA CCT 

tin Tic A r u 



AAA <I A CJ <: CI I" 
1. y * CI I ■ A r R 



AC A TA<: COT 
Af ji Tyr Oly 
1 0 5 



ATT A AO OOC 

lie l.y * lily 
1 0 

OAA AAA TAT 

CI I u l.y* Tyr 



COA AAC AAA 
A r 5 Asm l.y* 



TAT ATT OAT 

Tyr fie Amp 

6 a 

TAT ATA OCT 

Tyr lie A I ■ 

7 5 

OCA CJAC! ATT 

a i • o i k r i « 

9 0 

(ITT VCO A OA 

Vul Set A r r 



CTT OTO CAA 

I. e a V ■ I C! I a 
1 5 

OAA OAO CAT 

CJ I . (1 I a II I * 
3 0 

A AO TTT OAA 

I. y « P b e CI I n 
4 5 

OOT OAT CTT 

Oly A » p V.I 



OAC A A O CAC 
Asp 1. y m H I i 



TCA ATC CTT 
S o r Mot f. c > 
9 S 

ATT OCA TAT 
lie Ala T y t 
110 



CCC 4 8 

Pro 



TTO 9 6 

I. e u 



TTO 14 4 

I. e a. 



AAA 19 3 

l.y - 



AAC 2 4 0 

R 0 

CAA 2HK 

a i u 



AOT 3 3 6 

Sot 



( 
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AAA OAC TTT OA A ACT CTC AAA OTT OAT TTT CTT AOC A AO CTA CCT OA A 

Lys A s p P b c Gl « Tlit Leo L; i V • I A • p Pbc Lot Sox Ly « Lev Pro Gl o 

11S 12 0 125 

ATO CTG AAA A TO TTC OA A GAT COT TTA TOT CAT AAA ACA TAT TTA AAT 

Met Leo L y • Met Pbc Gl d Asp Arg Leo Cy • HI i Lys Tbr Tyr Lev Asn 

130' 13 5 1 4 0 

OCT GAT CAT GTA ACC CAT CCT GAC TTC ATG TTG TAT GAC CCT CTT GAT 

Oly Asp His V ■ 1 Thx His Pro Asp Pbc Mot Loo Tyz Asp Ala Loo Asp 

145 13 0 155 160 

OTT OTT TTA TAC ATG OAC CCA ATG TGC CTO OAT GCG TTC CCA AAA TTA 

V e I Vsl Lou Tyr Met Asp Pro Met Cye Leo Asp A 1 * Pbi Pro Lye Leo 

I6S 170 ITS 

OTT TOT TTT AAA AAA COT ATT OA A OCT ATC CCA CAA ATT OAT AAO TAC 

Vsl Cy* Pbo Lys Lys Azg lie Gin Alt lie Pro Gl a I l« Asp Lys Tyr 

180 1 H5 IVO 

T TO AAA TCC AOC A AG TAT ATA OCA TOO CCT TTO C AO OOC TOG CAA GCC 

Leo Lye Ser Ser Lys Tyr lie Ale Tip Pro Leo GU G I ; Tip Oln Ale 

1 V 5 2 0 0 2 0 5 

ACG TTT GGT OGT GOC OAC CAT CCT CCA AAA TCG GAT CTG OTT CCG CGT 

Thi Pbe Oly Oly Oly Asp His Pro Pro Lys Ser Asp L « ■ V * 1 Pro Atg 

2 10 2 15 2 2 0 

OCA TCT CGT CCT CCA TCT OTT GO A TCO CAT ATO CCC ATG GCC AAT TTA 

Oly Sot At 5 Arg Ale Sor Vsl Oly Ser His Mot Pro Mel Ale Asm Leo 

2 2 5 2 3 0 .2 3 5 2 4 O 

CTO ACC OTA CAC CAA AAT TTO CCT GCA TTA CCO OTC OAT OCA ACO AOT 

Lev Thr Vsl Ills Gin. Asn Lee Pro Ale Leo I" r o Vsl Aep Ale Tbx Ser 

245 350 255 

OAT GAG GTT CGC A AG AAC CTG ATG OAC ATO TTC AGG GAT CGC CAG GCG 

Asp C I o Vsl A r ^ Lys Asn Leo Met Asp Met rhe Arg Aep Arg Gin. Ale 

2 <5 0 265 270 

TTT TCT GAG CAT ACC TOG AAA ATG CTT CTG TCC GTT TGC CGG TCG T GO 

Phc Scr <; I o Ills Thr Trp I. y s Met Lea Lew Ser Vsl Cys Arg Ser Trp 

275 280 285 

GCG GCA TGG TGC A AG TTG AAT AAC CGG AAA TOG TTT CCC GCA G A A CCT 

Alt Alt Trp Cys y * I. en Ann Asn Arc I. ye Trp Phc Pro Als Gin Pro 
290 2 9 5 300 

OA A GAT GTT COC GAT TAT CTT CTA TAT CTT CAG GCG CGC GGT CTG GCA 

<; I u Asp Vsl Arg Asp Tyr I. en I. e n Tyr I. en Gin Ale At<> <! 1 y I. en Alo 

305 310 315 320 

OTA AAA ACT ATC CAO CAA CAT TTG OOC CAG CTA AAC ATO CTT CAT COT 

V.I l.y* Thr lie C I n Gin tils t.cn C I 7 Glu I. e n Asa Met I. en Hit Ar G 

325 330 335 

COO TCC OOO CTO CCA COA CCA AOT OAC AOC AAT OCT OTT TCA CTG GTT 

Arc. Ser C I y t. « ■ ¥ 9 a Ar C Pru Ser Asp. Ser A » e Aim V - I Ser I. e » Vsl 

3 4 0 3 4 5 3 5 0 

ATO COO COO ATC COA AAA OA A AAC OTT OAT OCC GOT OA A COT OCA AAA 

Mel Ar 8 Arg 'lie Arg I. y ft Gin Ann Vsl Aep Als Gly Cll • Arg Als I. y * 

3 5 5 3 6 0 365 

CAO OCT CTA OCC TTC CAA CGC ACT OAT TTC GAC CAG OTT COT TCA CTC 

C I • Als l.eu Als Phe Glw Arg Thr Asp Plie Asp Gin Vsl Arg Ser 1. eu 
370 375 3 8 0 

ATG CAA AAT AOC OAT CGC TGC CAG GAT ATA CGT AAT CTC GCA TTT CTG 

Me 1 fl I 1 Amn Sor Asp Arg Cys Ole Asp I I a Arg A • • I. « ■ Als Pbe I. ou 

3A5. 390 3 9 5 400 

GCG ATT OCT TAT AAC ACC CTG TTA C <i T ATA GCC OA A ATT GCC AGO ATC 

Gly Tie Alw Tyr Asa Tltr 1. uu I. c « Arg Tlo Als Gin. Tie Als Arg I I o 

405 410 415 - 

ACG GTT AAA CAT ATC TCA CCT ACT CAC OCT O G CI ACA ATC TTA ATC CAT 

Arg Vsl Lye Asp lie Sor Arg Tbr Asp Oly Oly Arg Met Loo Clc His 
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ATT OOC AG A ACO AAA ACQ CTO OTT AGC ACC GCA GOT GTA GAG A AG OCA 134 4 

lie G I y A r x Tbr Lj> Tfar Lew Val Ser Tfar Ala CI 7 Val Cln L71 Ala 

4 J S 440 4 4 5 

CTT AGC CTG OGO OTA ACT AAA CTO OTC OAO COA TOO ATT TCC OTC TCT 139 2 

Lev Scr Lev Gl ; Val Tbr L;i Leu Vil Cln Arjc Trp lie Ser Vat Ser 
450 455 460 

GOT OTA OCT OAT GAT CCO A A T AAC TAC CTO TTT TGC COO OTC A OA AAA 1440 

Oly Val Ala Asp Asp Pro Aaa A ■ a Tyr Lev fhc C;i Atg Val Aig L71 

4 6 3 47Q 47 5 4 R 0 

A AT GOT OTT OCC OCO CCA TCT OCC ACC AOC CAO CTA T CA ACT COC OCC 14 8ft 

Asa OI7 Val Ala Ala Pro Ser Ala Tbr Ser Ola Lee Ser Tbr Arg Ala 

4KS 4 9 G 49S 

CTG OA A OGG ATT TTT OAA OCA ACT CAT CGA TTO ATT TAC OOC OCT AAO 1S36 
Leo Gle Gly lie Phe GIs Ala Tbr Hla Aij Lee lie Tyr OI7 Ala Lye 
5 0 0 50 5 SIO 

OAT GAC TCT GGT C AG AO A TAC CTG OCC TOO TCT GO A CAC ACT OCC COT 1584 
Aap Aep-Sor Gl 7 OU Aig Tyr Lev Ala Tip Ser Oly Hia Ser .Ala Arg 

5 15. 5 2 l> 525 

GTC OGA GCC GCG COA GAT AT G OCC CGC GCT OGA OTT TCA ATA CCG OAO 163 2 

Val Ol j Ala Ala Aig Aap Met Ala Aig Ala Gty Val Ser lie Pro Ols 
530. S3S 540 

ATC ATG CAA GCT OCT GGC TOG ACC A AT GTA A AT ATT GTC ATO AAC TAT 168 0 

tie Met Ol n Ala Oly Oly Trp Tbr Aid Val Aaa lie Val Met Asm Tyr 

S4S 5 5 0 555 560 

ATC COT AAC CTO GAT AGT CAA ACA OGO OCA ATO CTG COC CTO CTO OAA 1728 
tie Arg Aaa Lev Asp Ser Civ Tbr Oly Ala Met Val Arg Lev Leo CI o 

5 0 5 570 575 

GAT GGC OAT TAG 1740 
Aap Gly Aap 

( 2 ) INFORMATION FOR SEQ ID NOOli 

( I ) SEQUENCE CHARACTERISTICS: 

( A ) IJSNGTHi 579 amino M&k 
( B ) TYPE: amino acid 
( || ) lOHOIXKIY: linear 

( 1 1 ) MOLECULE TYPE: protein 

( y 1 )KH(2UHNCK DIMCRtPTION: SliQ I!) NO-.I1: 

Met Scr Pro lie Leo Gly Tyr Trp Ly» lie Lya Gly Lev Val Gla Pro 
1.5 10 15 

Tbr Arjt Lot* Lea Leo. Gl u Tyr Lea Olu Ol « Lya Tyr Ola Gla Hla Leu 
2 0 25 3 0 

Tj» tilt. Ar C A»p Ol* Oly Aap l.y* Trp Ar c Aaa l.y • I. y * Phe (11«t I. cm 
3 5 4 « 4 5 

Oly Leo Olv Pbe Fro Aan Leo Pro Tyr Tyr lie Asp Oly Aap Val Lya 

5 0 5 5 * II 

Lev Tbr Gin Ser Mel Ala lie lie A r * Tyr lie Ala Aap Lya Hla Aid 

65, 70 75 80 

Mel l.ta Gly Oly C y a P r m 1. y a Cln Arg Ala Gl ■ rio Ser Mai I. em Olu 
8 5 9 0 9 5 

Oly Ala VmI I. c ■ Aap lie A r ^ Tyr Oly Val S o r A r A lie Ala Tyr Ser 
10 0- 105 110 

l.ya Aap Phe C I a Thr I. en I. ya Val Aap Phe I. o a Ser l.y 1 l.eu Pro Ola 
115 12 0 12 5 

Met Lee Lya Met Pbe Ol o Aap Arg Leo Cya Hla Lya Tbr Tyr Lea Aan 
13 0 13 5 I 4 (1 

Oly Aap Hla Val Tbr His Pro Aap Pbe Mel Lee Tyx Aap Ala Lea Aap 
14S 150 155 16Q 
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V • I Va I Loo Tyi Mot Asp Pro Mot Cyi Lea Asp Ala P h « Pro L y • Leo 
165 170 175 

Val C; i Pbe L y s Ly « Arg lie Gin Ala lie Fro Ola lie Asp Ly i Tyi 
180 165 190 

l.ca I. y s Sor Sor ' • y » Tyr lie Ala Trp F r n t. c a Ol « O I j Trp Ol • A I t 

.19 5 700. 2 0 3 

Thr Phe (il y (My (I I y Asp Ills Pro Pro I. j * Scr Aap I. e « V • I Pro Ar g 
210 2 15 220 

Oly 5or Art Arg Ala Sot ViI Ol y Sot Bi i Mot Pro Mot Ala A a » Lao 
2 2 5 nil 235 240 

Leu Tbf Vol His GU Aao Lee Pro Ala Lev Pro Val A • p Ala Thr Scr 

245 250. 255 

Asp Gta Val Arg Ly i Aso Lou Mol Asp Mel Phe Arg Asp Arg Gin Ala 

200 265 270 

Phe S c r (! U Ills Thr Trp t. ys Mel Leu Lea Sor Vsl C y * Are Ser 'ftp 
275 280 2 8 5 

Ala Ala Trp Cya Lyi Loo Asa. Asn Aig Ly i Trp Phe Pro Ala Ol v Pro 

290 295 3 O 0 

Ota Asp Val Arg A • p Tyi Lea Lea Tyi Lea Cla Ala Aig Gl y Leo Ala 

305 3 10 515- 320 

Vsl l-y* Thr lie Cla (1 I a Ills l.es Gl y (ill I. mm Ass Met I. e a II Is Arc 

325 330 335 

Arji Ser Gly f . em Pro Arg Pro Ser Asp Ser Ass Als Vsl Ser Lea Vsl 
340 3 4 5 3 5 0 

Met Arg Arg II* A t g Lyi C 1 u A s n V,a I Asp Ala Ol y Olo Arg Ala Ly i 

355 3 6 0 3 <* 5 

Gl a Ala Leo Ala Phe Gl d Arg Thr Aap Pbe Aap Gin Val Arg Scr Leo 

370 375 3B0 

Mcl G I n Asn Sor Asp Arg <Z y n Gin Asp Tie Arg Ass ?. e u Als Phe Leu 
315 390 395 400 

Oly tie Ala Tyr Asa Thr Leu Leu Arg lie Alt Olo lie Ala Arg lie 
405 410 415 

Are Val L y * Asp lie Scr Ar; Thr Asp CI y CI y Arg Met Leu lie Ills 
420 425 4 3 0 

lie Oly Arg Thr Lyi Thr Leo Val Ser Tbr Als Oly Val Olo Lya Ala 
4 3 5 4 4 11 4. 4 5 

Leo Scr Leu Oly Val Th/ Lyi Lea Val Gl u Arg Tip lie Ser Val Scr 
450 455 460 

Gly Val Alt* Asp Asp Pro Ass Asn Tyr l.cu Phe Cys Arg Vsl Arg I. ys 
405 470 475 480 

A • a Gly Vol Als Als Pro. Ser Als Tltr Ser CIs Lea Ser Thr Arg Ala 
4KS 4V0 495 

Leo Olo Oly lie Pbe Olo Als Tbr II It Arg Lea lie Tyr Oly Ala Lyi 

500 505 510 

Arp Asp Scr Oly Ola Arg Tyr Leo Als Trp Scr. Gly His Sor Als Arg 
SIS 520 525 

Vsl Gly Als Als Arg Asp Mel Ala Arg Als Gly Vsl Ser lie Pro Glu 
530 535 540 

lie Met G I ii Als Gly Gly Trp Th r A ■ « Vsl Ass lie Vsl Mul Ass Tyr 

545 550 555. 560 

lie Arg Aso Leo Asp Sct Glu, Thr Gly Als Met Vsl Arg Leu t.ea (I I a 
565 570 575 



Asp Gly Asp 



45 
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( 1 ) 1NMJRMA11UN FOR S13CJ U> NO:12: 

< i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 b**c pifas 
( U ) TY Mi: mdelc »dd 
( C. ) STRAKORITKRSS: dmblu 
( I) )TOrOrXXTY: Ihicur 

( 1 I )MOIJ«nflJITYrib other iraddc add 

( A ) DESCRIPTION: /doc - "DNA" 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NChli 

ATAACTTCOT ATAOCATACA TTATACOAAC TTAT 3 4 



( 7 ) tNWMtMATION H>H SlttJ II > NO: 1 3: 

< 1 ) SEOUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base p*ir» 
( R ) TV Hit: hkJm acid 
( C ) STRANDEDNBSS: docblc 
( D ) TOrOLOCYi linear 

( 1 1 ) MO I J <Cl II JtTYPlfc «fl«^ o«dck: *c«l 

( A > DESCRIPTION: /dese = DNA" 

( x i ) SEQUiiNCK DESCRIPTION: S£Q ID NO-.13: 

ATTACCTCOT ATAOCATACA TTATA C(!AA(i TTAT 34 



( 7 ) INFORMATION TOR STCQ m NO-.I4: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LUNCT1L 34 base pain 
( B ) TYTK: nacfvte acM 
( C ) STRANDRHNKSS: dtmbU: 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: <4Ux aadcic ackl 

( A ) DESCRIPTION: /dewe - "DNA" 

( x I )XKQUIlNC.1t OlSCRIPTtON: SI'Xl II) NO-.14: 

ATAACTTCOT ATAOCATACA TTATATGAAC TTAT 3 4 



( 2 ) INFORMATION FOR SEQ ID NO: 15: 

< 1 ) SEQUENCE CHARACTERISTICS: 
( A ) I JiN<mii 34 haftepafm 
( B ) TYPE: ancteie «cU 
( C ) STRANDEDNBSS: doable 
( D ) TOPOLOGY: Umr 

( 1 I ) MOI JtCnil J{TYKI{: tiller nucleic mc'mi 

( A ) DESCRIPTION: /dcM - "DNA" 

( * I )NKOUIiNCK DltSCRlPTtON: SNO. «» HO: 15s 

ATTACCTCOT ATAOCATACA TTATATGAAC TTAT 3 4 



( 7 ) INFORMATION FOR SMQ ID NO:1G: 

< I ) SliQUliNCK CI lAUAC'lliKISllCS: 
( A ) I .KNfTTK: 34 Utem pair* 
( 11 ) TYFTt: nucleic act! 
< C ) STRANDEDNBSS: doable 
( I) )TOKHjIX;y. linear 

( 1 I ) MOI.FCUI R TYPE: ad«;r n*cU:ic mc*\ 

( A ) DESCRIPTION: /desc - "DNA" 

( * 1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



ATAACTTCOT A T A CI T A T A C A T TATA(!(!AA (! TTAT 
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( 2 ) INJO&MAUON IVR SEQ U> WO: 17: 

( I > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 buc pahs 
( II ) mniz nvdclc *ckJ 
( C. ) STRANDRIWFSS: Otmbic 
( O ) TOrOI jCKTV: liooir 

( t I )MOIJ*:ilUiTVH£«<»iermicle»c »cW 

( A ) DESCRIPTION: /doc - TNA" 

* 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ.7: 
ACAACTTCCT ATAATGTATG CTATACGAAO T T AT 

( 1 ) lNK>KMAHON H)« SICQ II) NO:IH: 

( 1 > SEQUENCE CHARACTERISTICS: 
( A ) LENGTH. J4 baoc pain 
( II ) TYPIt nt*4c£c ackl 
( C ) STOANSEDNESS: double 
( D ) TOPOLOGY: tia&u 

< 1 1 ) MOIJtCUIJtTYPIfc oll« nvicic acid 

< A ) DESCRIPTION: /dec = "DMA" 

( x 1 > SEQUENCE Di£SCttJU r l"KJN: SEQ ID NO-.lifc 

<i aac;ttc:c:ta ttctctacaa actataccaa cttc 



We claim: 

1. ApUNI vector construct comprising, in 5* to 3 1 operable 
order: 

a) a conditional origin of replication; 

b) a sequence-specific recombinase target site having a 5' 35 
and a 3' end; and 

c) a unique restriction enzyme site, said restriction 
enzyme site located adjacent to said 3' end of said 
sequence-specific recombinase target site. 

2. The vector construct of claim 1 further comprising a 40 
prokaryolic termination sequence located 3* to said unique 
restriction enzyme site. 

3. *l*hc vector construct of claim 2, wherein said prokary- 
otie termination sequence is the 'I "7 termination sequence. 

4. The vector construct of claim 1 further comprising a 45 
eukaryolic polyadenylation sequence 1 oca Led 3' and adjacent 

to said unique restriction enzyme site. 

5. The vector construct of claim 4, wherein said polyade- 
nylation sequence is selected from the group consisting of 
the bovine growth bormone polyadenylation sequence, the 50 
simian virus 40 polyadenylation sequence and the Herpes 
simplex virus thymidine kinase polyadenylation sequence. 

6. The vector construct of claim 1 further comprising a 
selectable marker gene. 

7. The vector construct of claim 6, wherein said selectable 55 
marker is selected from the group consisting of kanamycin 
resistance gene, the ampicillin resistance gene, the tetracy- 
cline resistance gene, the chloramphenicol resistance gene, 
the streptomycin resistance gene, the strAgene and the sacB 
gene. 6<i 

- 8. The vector construct of claim 1, wherein said sequence- 
specific recombinase target site is selected from the group 
consisting of loxP, loxP2, loxP3, loxP23, loxPSll, loxD, 
loxC2, loxL. loxR, loxA86, loxA117, xrt, dif, and att 

9. The vector construct of claim 1 further comprising a €5 
gene of interest inserted into said unique restriction enzyme 
site. 



10. The vector construct of claim 1, wherein said vector 
construct is pUNl-10, comprising the nucleotide sequence 
set forth in SEQ ID NO:l. 

11. A pUNI vector construct comprising, in 5' to 3* 
operable order. 

a) a conditional origin of replication; 

b) a sequence-specific recombinase target site; and 

c) a poly linker, said polyl inker located adjacent to said 3' 
end of said sequence-specific recombinase target site. 

12. A pi IO ST expression vector construct comprising in 5* 
to 3* operable order: 

a) an origin of replication; 

b) a promoter element; and 

c) a sequence-specific recombinase targcL site. 

13. The expression vector construct of claim 12 further 
comprising a selectable marker gene. 

14. A fused expression vector construct produced by the 
recombination of a pUNI vector and a pIIOST expression 
vector, said fused expression vector comprising in 5' to 3* 
operable order: 

a} a promoter element; 

b) a first sequence -specific recombinase target site having 
a 5* and a 3* end; 

c) a gene of interest joined to said 3' end of said sequence- 
specific recombinase target site such that a functional 
translational reading frame is created; 

d) a conditional origin of replication; 

e) a first selectable marker gene; 

f) a second sequence-specific recombinase target site; and 

g) an origin of replication. 

15. The fused expression vector construct of claim 14 
further comprising a second selectable marker gene. 

16. A method for the in vitro recombination of a pUNI 
vector and a pHOST expression vector to form a fused 
expression vector, said method comprising: 



( 
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. a) providing: _ . 

!) a pUNT vector construct comprising, in 5 to 3 
operable order, a conditional origin of replication, a 
first sequence-specific rccombina.se target sile and a 
first selectable marker gcoc; 5 

ii) a pHOST expression vector construct comprising, in 
5' to 3' operable order, an origin of replication, a 
promoter element and a second sequence-specific 
rccombinasc target site; and 

iii) a site-specific rccombinasc; io 
b) contacting said pUNI vector and pHOST expression 

vector constructs in vitro with said site-specific rccom- 
binasc under conditions such that said fixst and second 
constructs are recombined to form a single fused 
expression vector. 

17. *1 tie method or claim 16, wherein said pUNI vector 
construct further comprises a gene of interest located adja- 
ccnL to said 3* end of said first sequence-specific rccombi- 
nase target site and before the selectable marker gene, 
wherein the recombination of said pUNT vector and pHOST ™ 
expression vector constructs places said gene of interest 
under the Uaoscrrptiooal control of said promoter element 
within said fused expression vector. 

18. The method of claim 17, wherein said pi IOST expres- 
sion vector construct further comprises a nucleotide 25 
sequence encoding an affinity domain and the recombinalion 
of said pUNI vector and pHOSTcxrcssion vector constructs 
results in placing said gene of interest in frame with said 
sequence encoding said affinity dcimain, thereby encoding a 
fusion protein having the affinity domain located at cither the 3lJ 
amino- or carboxy-lerminus of said fusion protein. 

19. 'Itic method of claim 16, wherein said pHOST vector 
c< nsirucl further comprises a second selectable marker gene, 
s^ : d second selectable marker being different from said first 
^electable marker. 3 ' 

20. A method for the recombination o£ pUNI vector and 
pi IOST expression vector constructs to form a single fused 
expression vector in a prokaryotic host, comprising: 
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a) providing: 

i) a pUNI vector construct comprising a conditional 
origin of replication, a first sequence-specific recom- 
binase target site having a 5' and a 3' end, a unique 
restriction enzyme site, said restriction enzyme sile 
located adjacent to said 3' end of said sequence- 
specific rccombinasc target site, and a first selectable 
marker gene; 

ii) a pHOST expression vector construct comprising in 
5* to 3' operable order, an origin of replication, a 
promoter clement having a 5' and a 3* end and a 
second sequence-specific rccombinasc target site 
having a 5' and a 3' end; 

iii) a prokarvotic host cell expressing a site-specific 
rccombinasc; 

b) introducing said pUNI vector and pi IOST expression 
vector constructs into said prokaryotic host cell, 
wherein said host cell expresses a site-specific 
recombinase, under conditions such that said pUNT 
vector and pHOST expression vector constructs are 
recombined to form a single fused expression vector. 

21. *l"hc method of claim 20 further comprising growing 
said host cell containing said single fused expression vector 
under conditions which select for the presence of said single 
fused expression vector. 

22. ITic method of claim 20, wherein said pUNI vector 
construct further comprises a prokaryotic termination 
sequence. 

23. *lnc method of claim 20, wherein said pUNI vector 
construct further comprises a gene of interest inserted into 
said unique restriction enzyme site. 

24. The method of claim 20, wherein said pUNI vector 
construct further comprises a cukaryotic polyadcnylation 
sequence. 

***** 
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»^ L ^Si LLEGE 0F MEDICINE and 
BCM TECHNOLOGIES, INC., 



Plaintiffs, 



CLONTECH LABS, INC., 



Defendant. 
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IN THE DISTRICT COURT 



HARRIS COUNTY.fEXAS \ 



JUDICIAL DlSTjycVi " 




PLAINTIFFS' ORIGINAL PETITION 
**" °"*'« « BCM Technologies, Int. 

PARTIES 

I. Plaintiff BayteU, nonprofit eorpomion organized and eriatag nnte fce u„ 

M^rnir * . p*„ corpora*. te miiatltas ift ^ plKe ofbutoKs m w 

Dryden Road, Suite 900, Howon, Tea, 77030. BCMTUa »ioD, 
responsible f« licenjug B.ylor's tecteology. 



p-owned subsidiary of Baylor 



1 Defend*. cWh h , mfmlin ^ ^ ^ ^ ^ 

- .020 «. Meadow ^ ClIjfonij9 ^ Beclme ^ ^ ^ 

designated a reared eg™ fcr ^ ^ fc ^ ^ ^ f ^ 
CHI . ^ ^ ?etviw My ^ ^ ^ ^ ^ ^ ^ ^ 
s«« »i* *. ^ who M ^ _ Mii . ^ ^ ^ ^ ^ 

principaJ place of business. 



its 



9T:9T 



JCRlSDiq-ION AND VENUE 

J-*** ^ „ ta Mnlrovosy . ^ ^ . ^ m ^ ^ ^ 
<oi, e ^ Texas, ^ ^ ^ „ ^ ^ ^ ^ ^ 

Te», «, ^ ^ tamai „ wiU,!^^ ^^^^ whoIewini)ait 

in this State. 

wereamimlteimwholeoriii^mH^&mty.Ttat 

PLAINTIFF BAYLOR 

— Biytor ^ ^ ^ ^ voAiaAi)hamU)(is ^ 

— * 3S9 gtttac 2S5 ^ 12j ^ hralth ^ ^ ^ 

itsidcnt physio™ p«, fag ^ ^ „ ^ ^ ^ ^ ^ 

ft*, oflen pata, ca. smi c« ^ ^ of i(s Ta[is ^ ^ ^ 
h.sp i0l s, rmUitte nm ^ 131jM() ^ ^ ^ ^ 2 ^ ^ 
annually. 



6- « la the past fiscal year, Baylor was nmlccd number 



one among universities and 
«*. » M ^ ^ fi^ ^ ^ devetapmal , fc ^ ^ 

•* M reseat suppon of ».0 ^ more ta 70 reMrci W p*^ — 
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^CC'ON 
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of the 125 U.S. medical school, in federal research 

?• ^^^.B-yte^^co^lo^of^.,,^^ 
devest ,. ^ „ f ^ ^ ^ . ^ ^ ^ ^ ^ 

**over, ^---^^^..^ ^ 

«m of its development efforts „ fieId of s^,^ ^ ^ 

composifons and roethods for rapid subdomng of DNA which provide] a very quick and ample 
syaeu, of identifying ^ ^ fc ^ ^ ^ & ^ 

■ssued December 22, 1998 as US. Patent 5,8S1,808 frte 

of the Eledge technology on November 19 too. «„ ta „. . 

«r iysi trior to this publication, the EUedge 

technology was kept confidential by Baylor as tmte 

« »y oayior as trade secrets, and was not known to either the 

public or the industries in which it night be exploited 

LICENSING THE ELLEDGE TECHNOLOGY 
«• I» April of it* Bayfor en** UcaBine ^ ^ 

<- OM» technoiogy. As pan of those negonations. it™ neces»ry for Baylor „ t^sfer 
various W de ^ inform.., „ ^ „ ^ ^ ^ ^ ^ ^ 

*M a Confided Disclosure Agre^, ^ BCMT and a Materia! Transfer Agrees 
-* Bay,o, Subs^ B these rwo con,**, Baylor disposed ,he EUedge technology to 
dented, induding *. *«n pending patent applied coving pan' of a, ^Cogy «. 
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— ^ free ammatim m u ^ AHrodmawy ^ ^ 

(1) a uni-vcctorplasmid(pUNiio); 

(3) a ^™^eexprejsioiipi.snud( erD OLE3 «,.r„ 

and r ltfp ^^' th «Osi-cre expression pbsmid); 

(4) the ceJl line, for conducmg 0„ «»di« (BONI3 BUN15) 

« a^ rot „ ^ ^ tftte ^ ^ ^ ^ ^ 

'"'""'"SX. ""less such infonnation became publicly availably and to use toe materials 

Confidential Disclosure Agreement include: 

(HI] ^ mcrit 3X141 commercial potential thereof 

« ha* toajj . . ?Bo5 

J ^porary or permanent injunction." 6J 



(1) 

(2) 

(3) 
(4) 
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11. 



Key provisions of the Material Transfer Agreem 



ent include: 



0) 



(2) 



(3) 



and may not be oursued hv th* pp^i^. ™ ^Prasiy Prohibited 

«iU pranptly dtatae to BCM all iaveS, T^Tr^Z 

Xmpjot 10 . aotmcUiave licen* » u* the s»u fer raearch p^oS? f, fl .. 

™ s " '***»* <"mi for . license to the Bledge technology. 

licensed to a different company. 



DEFENDANT CLONTECH 
H. Upon notice that they would not be the licensee, Clontech expressed extreme 
displeasure. On or about December 18, im.^l^K^on^f^^^ 
Clontech, and two other Clontech employees stated to Baylor that Clontech had already begun 
commercial product development using the Bledge technology. 



15. Approximately one year later. Clontech launched its-Creator system for rapid 
subcloning of DNA, and fled patent applications on their 
similar to the EUedge technology disclosed by Baylor. 



system. The Clontech product is 



sea hx'on 



61 *9T 



Co q ( o 

16. Upon intaauou and beKef, CtaMft Cr^r s^tem « upon and mate 

~ of b au» «. ^ ^ fc ^ ^ ^ ^ ^ ^ ^ 

Up «* seer*, tawt, „. Elledge tectnoIojy ^ ^ ^ ^ 

Baylor and BCMT. 

COUNT I - BREACH OF CONTRACT 
■17. For this cause of action, Baylor and BCMT repeat and adopt all allegations 
contained in paragraphs I through 16 above. 

18. Defcnto cl.»«* h* bre«M k oblige (ex^ed »! 

mm uader the Confide DUCo^ Apecma ^ 

». Defend™, Clonta, has brewed fc cortnaua, oblige <«p«s*d .,4 
implied) under * 0-«*«M Disclosure AgmaM ai ^ Trajfe ^^^^ ^ 
Mr dh ihli^b Euedge «j, SKreB „ e^,^ of ^ ^ 
and for ftiling to nos^ Baylor of such desmjcUon. 

20. C tal ^ a« brcacM to obli^ m (e[preseJ 

in.p«ed> under «he Confided Disclosure Agremea „ Materia! Transfer A**™*,, by, 
i*r * faffing „ disclose . Baylc, ail ^vennon, nvad. .hrough u» «. of a. EHedgc «d= 

secrets. 

11. Defend*, domed, ha, breached ft, conned obligaion, (wpf£ue<t ^ 
l«P«ed) Oder d* Coondendal DM.. Agreeneat „, ^ ^ ' 
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the EUedge trade secrets. 

COUNT II - MISAPPROPRIATION OF TRADE SECRETS 

22. ^^Ithrough21are3pedfic a llyincorpo ra trf 

23. Baylor and BCMT are, and at all times mentioned in this petition have been, 
engaged inan effort to develop and commercialize the technology at issue in this litigation. 

the Sled* .technology were not known to the public or the industry in which they might be 
used. The EUedge trade secrets, including the Elledge technology were developed at 
considerable expense to Baylor, and they constitute*- valuable trade secrets that would give a 
competitive advantage to anyone commercially exploiting them. 

25. Clontech intentionally and wrongfully misappropriated trade secrets from Bay!or 
and BCMT for the purpose of commercially using them. 

COUNT m- CONVERSION 

26. P^hsl through 25 axe spe^^ 

27. Clontech has unlawfully asserted donunion and control over property owned by 
Baylor and BCMT by: 

a. gaining access to Baylor and BCMTs confidential and trade secret 
information and materials; and 
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b. taking Baylor andWs confidential infbr^ on , ^ ^ ^ 
materials and converting them for its own use. 

28. Upon infonnation and belief the conversion by Clontech has been willful and 
malicious, with an attempt to cause the harm alleged. 

COUNT IV - BREACH OF FIDUCIARY RELATIONSHIP 

29. P ^*l through 28 are ^ 

30. ClontechundertookarelaHonslupof^ 

whereby Clontech owed Baylor and BCMT fiduciary duties, including duties of good fahh and 
fair dealing, loyalty, and full disclosure to Baylor andBCMT. 

31. Clontech has breached its fi^idary duties to Baylor and BCMT by usmg me^ 
confidential iiifonnation, trade secrets, and materials. 

COUNT V - STATUTORY CLAIM FOR THEFT OF TRADE SECRET 

32. ^graphs 1 tr™^ 

33. Clontech intentionally and knowingly misappropriated the confidential 
information, trade secrets, and materials of Baylor and BCMT and adopted them as its own. 

34. Pursuant to * 134.003 of Texas Civil Practice & Remedies Code, and « 31.05 of 
the Texas Penal Code, Clontech should be held liable for theft of trade seer*. ^ to 

• 134.005(b) of Texas Civil Practice A Remedies Code, plaintiff, are *o entitled to attorneys, 
fees and costs. 
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^aCRY DEMAND 

3, Baylor and BCMT demand , ^ ^ . „ „ ^ h ^ ^ ^ 

laws, of statutes of the United States, or any applicable state constitution, 'law, of statute, or at 
common law. 

PRAYER FOR RELIEF 
Wherefore, Baylor and BCMTpray for relief as follows: 

(1) Injunctive Relief 

^.oorm^u^^^^^.^^^^^^^ 
a* semtrts and a person, and entities i» con Mnor p, rfc ip aU0I1 ^ %m 

derived inform*™ ta ^ ^ ^ 

*xx*°0 notebooks, conume, ^ aes . ^ ^ ^ 

-MIA and Oi) the Cr«ror ay*™ and derived inform^ wlulevtt ta 

(ineludir* « dolors, record,, ,*.,*„, M[eboobj ^ ^ 
correspondence, mail, and other materials); 

b. ^^Oontechc^e^ 

-ret,, including the EUedge technology, or any derived information or 
technology, and (U) using, feting, filing, diluting or otherwise exploiting 
the Creator system or any derived infonnatjon or technology; 
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c o * o . 

c ordering that Clomech cease and refrain from engaging in any conduct or activity 
which has the effect 0 f interfering with terminating or diminishing any of 
plaintiff contracts and/or business relationships with prospective licensees; and 

d. ordering that Clontech cease and refrain from destroying or altering (Q any all 
Elledge trade secrets, including the Elledge technology, as well as any and all 
derived information or technology, in whatever form (such as documents, records, 
laboratory notebooks, computer disks, files, correspondence, mail, and other 
materials) and (ii) the Creator system and any derived information or technology, 

- in whatever form. 

(2) Actual damages in an amount to be determined at trial 

(3) Punitive damages in an amount of at least three times the amount of actual 
damages awarded; 

(4) An equitable order requiring defendant to disgorge to plaintiffs the full amount of 
all royalties, licensing fees, sales, and profits it has collected or retained in connection with (i) 
the Elledge trade secrets and any derived infcrmanon or technology and (ii) the Creator system 
and any derived information or technology; 

(5) An assignment of all right and interest over all data, work product, intellectual 
property (including, but not limited to all patents and patent applications), personal property, 
trade secrets which are related to the Elledge trade secrets, the Creator system, or any derived 
information or technology. 
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O AO 

d* «** pa* „ chapter 38 of th. T«a, Civil Notice & Remedies C ode „d ote 
applicable Uw, 



(7) Prejudgment and post-judgment interest at the highest ratefc) allowed by law; 

(8) Costs of court; and 



(9) Such ofcoaiuifurwa relief oa 
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